Computer-Assisted Text Analysis for Social Science: Topic Models and Beyond.

作者: Ryan Wesslen

DOI:

关键词:

摘要: Topic models are a family of statistical-based algorithms to summarize, explore and index large collections text documents. After decade research led by computer scientists, topic have spread social science as new generation data-driven scientists searched for tools unstructured text. Recently, contributed model literature with developments in causal inference handling the problem multi-modality. In this paper, I provide review on evolution modeling including extensions document covariates, methods evaluation interpretation, advances interactive visualizations along each aspect's relevance application research.

参考文章(68)
Aki Vehtari, David B. Dunson, Andrew Gelman, Hal S. Stern, Donald B. Rubin, John B. Carlin, Bayesian Data Analysis ,(1995)
Todd Kulesza, Margaret Burnett, Weng-Keen Wong, Simone Stumpf, Principles of Explanatory Debugging to Personalize Interactive Machine Learning intelligent user interfaces. pp. 126- 137 ,(2015) , 10.1145/2678025.2701399
Liangjie Hong, Brian D. Davison, Empirical study of topic modeling in Twitter Proceedings of the First Workshop on Social Media Analytics - SOMA '10. pp. 80- 88 ,(2010) , 10.1145/1964858.1964870
David M. Blei, John D. Lafferty, Dynamic topic models Proceedings of the 23rd international conference on Machine learning - ICML '06. pp. 113- 120 ,(2006) , 10.1145/1143844.1143859
Weiwei Cui, Shixia Liu, Zhuofeng Wu, Hao Wei, None, How Hierarchical Topics Evolve in Large Text Corpora. IEEE Transactions on Visualization and Computer Graphics. ,vol. 20, pp. 2281- 2290 ,(2014) , 10.1109/TVCG.2014.2346433
Michael J. Paul, Mark Dredze, Discovering Health Topics in Social Media Using Topic Models PLoS ONE. ,vol. 9, pp. e103408- ,(2014) , 10.1371/JOURNAL.PONE.0103408
Eric Alexander, Joe Kohlmann, Robin Valenza, Michael Witmore, Michael Gleicher, Serendip: Topic model-driven visual exploration of text corpora visual analytics science and technology. pp. 173- 182 ,(2014) , 10.1109/VAST.2014.7042493
Jason Chuang, Christopher D. Manning, Jeffrey Heer, Termite Proceedings of the International Working Conference on Advanced Visual Interfaces - AVI '12. pp. 74- 77 ,(2012) , 10.1145/2254556.2254572
Justin Grimmer, Brandon M. Stewart, Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts Political Analysis. ,vol. 21, pp. 267- 297 ,(2013) , 10.1093/PAN/MPS028
Margaret E. Roberts, Brandon M. Stewart, Dustin Tingley, Christopher Lucas, Jetson Leder-Luis, Shana Kushner Gadarian, Bethany Albertson, David G. Rand, Structural Topic Models for Open-Ended Survey Responses American Journal of Political Science. ,vol. 58, pp. 1064- 1082 ,(2014) , 10.1111/AJPS.12103