nEmesis: Which Restaurants Should You Avoid Today?

作者: Vincent Silenzio , Henry A. Kautz , Adam Sadilek , Sean Padraig Brennan

DOI:

关键词:

摘要: Computational approaches to health monitoring and epidemiology continue evolve rapidly. We present an end-to-end system, nEmesis, that automatically identifies restaurants posing public risks. Leveraging a language model of Twitter users' online communication, nEmesis finds individuals who are likely suffering from foodborne illness. People's visits modeled by matching GPS data embedded in the messages with restaurant addresses. As result, we can assign each venue "health score" based on proportion customers fell ill shortly after visiting it. Statistical analysis reveals our inferred score correlates ( r = 0.30) official inspection Department Health Mental Hygiene (DOHMH). investigate joint associations multiple factors mined DOHMH violation scores find over 23% variance be explained factors. demonstrate readily accessible used detect cases illness timely manner. This approach offers inexpensive way enhance current methods monitor food safety (e.g., adaptive inspections) identify potentially problematic venues near-real time.

参考文章(36)
Michael Gamon, Eric Horvitz, Munmun De Choudhury, Scott Counts, Predicting Depression via Social Media international conference on weblogs and social media. ,(2013)
Vincent Silenzio, Henry A. Kautz, Adam Sadilek, Modeling Spread of Disease from Social Interactions international conference on weblogs and social media. ,(2012)
Nathalie Japkowicz, Learning from Imbalanced Data Sets: A Comparison of Various Strategies * International Workshop on Learning from Imbalanced Data Sets. ,(2000)
Mark Dredze, Michael J. Paul, You Are What You Tweet: Analyzing Twitter for Public Health international conference on weblogs and social media. ,(2011)
J. Glenn Morris, Morris E. Potter, Foodborne infections and intoxications Elsevier/Academic Press. ,(2013)
Vasileios Lampos, Tijl De Bie, Nello Cristianini, Flu detector: tracking epidemics on twitter european conference on machine learning. ,vol. 6323, pp. 599- 602 ,(2010) , 10.1007/978-3-642-15939-8_42
Vincent Silenzio, Henry Kautz, Adam Sadilek, Predicting disease transmission from geo-tagged micro-blog data national conference on artificial intelligence. pp. 136- 142 ,(2012)
Dafna Shahaf, Eric Horvitz, Generalized task markets for human and machine computation national conference on artificial intelligence. pp. 986- 993 ,(2010)
Andranik Tumasjan, Isabell M. Welpe, Philipp G. Sandner, Timm Oliver Sprenger, Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment international conference on weblogs and social media. ,(2010)
Henry Kautz, Adam Sadilek, Sean Brennan, Towards understanding global spread of disease from everyday interpersonal interactions international joint conference on artificial intelligence. pp. 2783- 2789 ,(2013)