Interpretation and trust

作者: Jason Chuang , Daniel Ramage , Christopher Manning , Jeffrey Heer

DOI: 10.1145/2207676.2207738

关键词:

摘要: Statistical topic models can help analysts discover patterns in large text corpora by identifying recurring sets of words and enabling exploration topical concepts. However, understanding validating the output these itself be a challenging analysis task. In this paper, we offer two design considerations - interpretation trust for designing visualizations based on data-driven models. Interpretation refers to facility with which an analyst makes inferences about data through lens model abstraction. Trust actual perceived accuracy analyst's inferences. These derive from our experiences developing Stanford Dissertation Browser, tool exploring over 9,000 Ph.D. theses similarity, subsequent review existing literature. We contribute novel similarity measure collections notion "word-borrowing" that arose iterative process. Based literature review, distill set recommendations describe how they promote interpretable trustworthy visual tools.

参考文章(54)
James J Thomas Kristin A Cook, None, Illuminating the Path: The Research and Development Agenda for Visual Analytics United States. Department of Homeland Security. ,(2005)
Pamela E. Sandstrom, Scholarly communication as a socioecological system Scientometrics. ,vol. 51, pp. 573- 605 ,(2001) , 10.1023/A:1019655305286
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Daniel Ramage, David Hall, Ramesh Nallapati, Christopher D. Manning, Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora empirical methods in natural language processing. pp. 248- 256 ,(2009) , 10.3115/1699510.1699543
H. P. Luhn, The automatic creation of literature abstracts Ibm Journal of Research and Development. ,vol. 2, pp. 159- 165 ,(1958) , 10.1147/RD.22.0159
James Sinclair, Michael Cardew-Hall, The folksonomy tag cloud: when is it useful? Journal of Information Science. ,vol. 34, pp. 15- 29 ,(2008) , 10.1177/0165551506078083
T. L. Griffiths, M. Steyvers, Finding scientific topics Proceedings of the National Academy of Sciences of the United States of America. ,vol. 101, pp. 5228- 5235 ,(2004) , 10.1073/PNAS.0307752101
Jerry Alan Fails, Dan R. Olsen, Interactive machine learning intelligent user interfaces. pp. 39- 45 ,(2003) , 10.1145/604045.604056
Douglass R. Cutting, David R. Karger, Jan O. Pedersen, Constant interaction-time scatter/gather browsing of very large document collections international acm sigir conference on research and development in information retrieval. pp. 126- 134 ,(1993) , 10.1145/160688.160706
Weimao Ke, Cassidy R. Sugimoto, Javed Mostafa, Dynamicity vs. effectiveness: studying online clustering for scatter/gather international acm sigir conference on research and development in information retrieval. pp. 19- 26 ,(2009) , 10.1145/1571941.1571947