TY - GEN
T1 - Public sharing of medical advice using social media
T2 - 17th International Conference on Grey Literature: A New Wave of Textual and Non-Textual Grey Literature, GL 2016
AU - Leroy, Gondy
AU - Harber, Philip
AU - Revere, Debra
PY - 2016
Y1 - 2016
N2 - Introduction: Social media tools, such as Facebook®, Twitter™, blogs and online communities, are increasingly utilized for networking and to distribute information in medicine and public health. Participation in these media has increased sharply over the past decade. Six years ago, Twitter did not exist yet now an estimated 15% of the world population subscribes to Twitter. This has created a large-scale, complex, and unindexed publicly available data source. Goal: We sought to understand the richness and novelty of health-related Tweets by analyzing the characteristics of health information-focused tweets using automated and manual analysis. Research methods: Utilizing the Twitter Search application programming interface (API) we retrieved two sets of English language tweets using keywords related to asthma (#asthma and asthma). Tweets were categorized by the assumed source (retweeted by a person, sent by organization, originated by an individual) and content (containing medication, symptoms, triggers, a combination, or none of these) using natural language processing. Regarding tweet source we assumed that tweets retweeted to a person (i.e., @username) were sent by an individual; those not retweeted that contained a URL were sent by an organization; and those tweets remaining were original content tweeted by an individual. Regarding content categorization, we used lexicons containing terms for asthma medication, symptoms, and five different types of asthma triggers (activities, air pollutants, allergens, environmental and irritants). In addition, we conducted content analysis using a combined text mining and manual approach. Applying association rule mining to the tweets, we generated an overview of the most frequency combination of terms presented as if-then rules. The manual, in-depth analysis evaluated a random sample of 200 tweets for originality, content, credibility and relevance. Costs: The costs associated with this project were time to process tweets. While over 500 million tweets are generated daily, the cost of this information distribution is shared among millions of Twitter subscribers. Results: The analysis showed that the majority of tweets contain URLS and many are retweeted. The proportion of tweets containing personal, new content is small. The majority of tweets are sent by organizations, both commercial and noncommercial, and the content are broad facts and statements. Both medication and environmental triggers are common topics. Conclusion: The high diversity in topics and terminology combined with the small proportion of personal tweets should be taken into account when using Twitter as a resource for tracking and discovering health behaviors or problems in the population. The large proportion of tweets referring to external information may make this a very useful tool for accessing grey literature and using the tweets as descriptors. Further research is needed to create comprehensive vocabularies and methods to efficiently labels tweets.
AB - Introduction: Social media tools, such as Facebook®, Twitter™, blogs and online communities, are increasingly utilized for networking and to distribute information in medicine and public health. Participation in these media has increased sharply over the past decade. Six years ago, Twitter did not exist yet now an estimated 15% of the world population subscribes to Twitter. This has created a large-scale, complex, and unindexed publicly available data source. Goal: We sought to understand the richness and novelty of health-related Tweets by analyzing the characteristics of health information-focused tweets using automated and manual analysis. Research methods: Utilizing the Twitter Search application programming interface (API) we retrieved two sets of English language tweets using keywords related to asthma (#asthma and asthma). Tweets were categorized by the assumed source (retweeted by a person, sent by organization, originated by an individual) and content (containing medication, symptoms, triggers, a combination, or none of these) using natural language processing. Regarding tweet source we assumed that tweets retweeted to a person (i.e., @username) were sent by an individual; those not retweeted that contained a URL were sent by an organization; and those tweets remaining were original content tweeted by an individual. Regarding content categorization, we used lexicons containing terms for asthma medication, symptoms, and five different types of asthma triggers (activities, air pollutants, allergens, environmental and irritants). In addition, we conducted content analysis using a combined text mining and manual approach. Applying association rule mining to the tweets, we generated an overview of the most frequency combination of terms presented as if-then rules. The manual, in-depth analysis evaluated a random sample of 200 tweets for originality, content, credibility and relevance. Costs: The costs associated with this project were time to process tweets. While over 500 million tweets are generated daily, the cost of this information distribution is shared among millions of Twitter subscribers. Results: The analysis showed that the majority of tweets contain URLS and many are retweeted. The proportion of tweets containing personal, new content is small. The majority of tweets are sent by organizations, both commercial and noncommercial, and the content are broad facts and statements. Both medication and environmental triggers are common topics. Conclusion: The high diversity in topics and terminology combined with the small proportion of personal tweets should be taken into account when using Twitter as a resource for tracking and discovering health behaviors or problems in the population. The large proportion of tweets referring to external information may make this a very useful tool for accessing grey literature and using the tweets as descriptors. Further research is needed to create comprehensive vocabularies and methods to efficiently labels tweets.
UR - http://www.scopus.com/inward/record.url?scp=85012934440&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85012934440&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85012934440
T3 - GL-Conference Series: Conference Proceedings
SP - 83
EP - 93
BT - 17th International Conference on Grey Literature
PB - TextRelease
Y2 - 1 December 2015 through 2 December 2015
ER -