Enhancing Seasonal Influenza Surveillance: Topic Analysis of Widely Used Medicinal Drugs Using Twitter Data.

作者: Ireneus Kagashe , Zhijun Yan , Imran Suheryani

DOI: 10.2196/JMIR.7393

关键词: Survey methodologyData miningLatent Dirichlet allocationOseltamivirDiseasePublic health surveillanceSocial mediaSocial desirability biasMedicineEnvironmental healthFlu season

摘要: Background: Uptake of medicinal drugs (preventive or treatment) is among the approaches used to control disease outbreaks, and therefore, it vital importance be aware counts frequencies most commonly trending topics about these from consumers for successful implementation measures. Traditional survey methods would have accomplished this study, but they are too costly in terms resources needed, subject social desirability bias discovery. Hence, there a need use alternative efficient means such as Twitter data machine learning (ML) techniques. Objective: Using data, aim study was (1) provide methodological extension efficiently extracting widely consumed during seasonal influenza (2) extract tweets infer how insights provided by can enhance surveillance. Methods: From collected 2012-13 flu season, we first identified with mentions then constructed an ML classifier using dependency words features. The that evidenced consumption drugs, out which mostly drugs. Finally, extracted each drugs’ latent Dirichlet allocation (LDA). Results: Our proposed obtained F1 score 0.82, significantly outperformed two benchmark classifiers (ie, P<.001 lexicon-based P=.048 1-gram term frequency [TF]). 40,428 50,828 were virus vaccines had around 76.95% (31,111/40,428) share total; other notable Theraflu, DayQuil, NyQuil, vitamins, acetaminophen, oseltamivir. exhibited common themes experiences people who Among enabling deterrent factors uptake, keys mitigating severity outbreaks. Conclusions: results showed feasibility surveillance lieu traditional conventional approaches. Public health officials stakeholders benefit findings especially enhancing strategies extended outbreaks diseases. [J Med Internet Res 2017;19(9):e315]

参考文章(61)
Xiang Ji, Soon Ae Chun, Zhi Wei, James Geller, Twitter sentiment classification for measuring public health concerns Social Network Analysis and Mining. ,vol. 5, pp. 13- ,(2015) , 10.1007/S13278-015-0253-5
Rachel Lynn Kendra, Suman Karki, Jesse Lee Eickholt, Lisa Gandy, Characterizing the Discussion of Antibiotics in the Twittersphere: What is the Bigger Picture? Journal of Medical Internet Research. ,vol. 17, ,(2015) , 10.2196/JMIR.4220
Lisa M. Lee, Steven M. Teutsch, Stephen B. Thacker, Michael E. St. Louis, Principles & Practice of Public Health Surveillance Oxford University Press. ,(2010) , 10.1093/ACPROF:OSO/9780195372922.001.0001
Heather Cole-Lewis, Arun Varghese, Amy Sanders, Mary Schwarz, Jillian Pugatch, Erik Augustson, Assessing Electronic Cigarette-Related Tweets for Sentiment and Content Using Supervised Machine Learning Journal of Medical Internet Research. ,vol. 17, ,(2015) , 10.2196/JMIR.4392
Alan R. Aronson, Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program american medical informatics association annual symposium. pp. 17- 21 ,(2001)
Lisa M Gargano, Natasha L Underwood, Jessica M Sales, Katherine Seib, Christopher Morfaw, Dennis Murray, Ralph J DiClemente, James M Hughes, Influence of sources of information about influenza vaccine on parental attitudes and adolescent vaccine receipt Human Vaccines & Immunotherapeutics. ,vol. 11, pp. 1641- 1647 ,(2015) , 10.1080/21645515.2015.1038445
Zhijun Yin, Daniel Fabbri, S Trent Rosenbloom, Bradley Malin, A Scalable Framework to Detect Personal Health Mentions on Twitter Journal of Medical Internet Research. ,vol. 17, ,(2015) , 10.2196/JMIR.4305
Raminta Daniulaityte, Ramzi W Nahhas, Sanjaya Wijeratne, Robert G Carlson, Francois R Lamy, Silvia S Martins, Edward W Boyer, G Alan Smith, Amit Sheth, None, “Time for dabs”: Analyzing Twitter data on marijuana concentrates across the U.S. Drug and Alcohol Dependence. ,vol. 155, pp. 307- 311 ,(2015) , 10.1016/J.DRUGALCDEP.2015.07.1199
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Christopher Weeg, H Andrew Schwartz, Shawndra Hill, Raina M Merchant, Catalina Arango, Lyle Ungar, Using Twitter to Measure Public Discussion of Diseases: A Case Study JMIR public health and surveillance. ,vol. 1, ,(2015) , 10.2196/PUBLICHEALTH.3953