Online Visual Analytics of Text Streams

作者: Shixia Liu , Jialun Yin , Xiting Wang , Weiwei Cui , Kelei Cao

DOI: 10.1109/TVCG.2015.2509990

关键词:

摘要: We present an online visual analytics approach to helping users explore and understand hierarchical topic evolution in high-volume text streams. The key idea behind this is identify representative topics incoming documents align them with the existing that they immediately follow (in time). To end, we learn a set of streaming tree cuts from trees based on user-selected focus nodes. A dynamic Bayesian network model has been developed derive balance fitness each cut smoothness between adjacent cuts. By connecting corresponding at different times, are able provide overview evolving topics. sedimentation-based visualization designed enable interactive analysis data global patterns local details. evaluated our method real-world datasets results generally favorable.

参考文章(43)
Eric P. Xing, Amr Ahmed, Dynamic Non-Parametric Mixture Models and the Recurrent Chinese Restaurant Process: with Applications to Evolutionary Clustering. siam international conference on data mining. pp. 219- 230 ,(2008)
Yee Whye Teh, Katherine A. Heller, Charles Blundell, Bayesian rose trees uncertainty in artificial intelligence. pp. 65- 72 ,(2010)
Charles Blundell, Yee Whye Teh, Katherine A. Heller, Discovering Nonbinary Hierarchical Structures with Bayesian Rose Trees In: Mengersen, K and Robert, CP and Titterington, M, (eds.) Mixture: Estimation and Applications. (pp. 161-187). John Wiley & Sons: Chichester, UK. (2011). pp. 161- 187 ,(2011) , 10.1002/9781119995678.CH8
Eric P. Xing, Amr Ahmed, Timeline: a dynamic hierarchical dirichlet process model for recovering birth/death and evolution of topics in text stream uncertainty in artificial intelligence. pp. 20- 29 ,(2010)
Zekai J. Gao, Yangqiu Song, Shixia Liu, Haixun Wang, Hao Wei, Yang Chen, Weiwei Cui, Tracking and Connecting Topics via Incremental Hierarchical Dirichlet Processes international conference on data mining. pp. 1056- 1061 ,(2011) , 10.1109/ICDM.2011.148
Nikunj Oza, Duo Zhang, Jiawei Han, Ashok Srivastava, ChengXiang Zhai, Topic modeling for OLAP on multidimensional text databases: topic cube and its applications Statistical Analysis and Data Mining. ,vol. 2, pp. 378- 395 ,(2009) , 10.1002/SAM.V2:5/6
Wenwen Dou, Li Yu, Xiaoyu Wang, Zhiqiang Ma, William Ribarsky, HierarchicalTopics: Visually Exploring Large Text Collections Using Topic Hierarchies IEEE Transactions on Visualization and Computer Graphics. ,vol. 19, pp. 2002- 2011 ,(2013) , 10.1109/TVCG.2013.162
Shixia Liu, Michelle X. Zhou, Shimei Pan, Yangqiu Song, Weihong Qian, Weijia Cai, Xiaoxiao Lian, TIARA: Interactive, Topic-Based Visual Text Summarization and Analysis ACM Transactions on Intelligent Systems and Technology. ,vol. 3, pp. 25- ,(2012) , 10.1145/2089094.2089101
Panpan Xu, Yingcai Wu, Enxun Wei, Tai-Quan Peng, Shixia Liu, Jonathan J. H. Zhu, Huamin Qu, Visual Analysis of Topic Competition on Social Media IEEE Transactions on Visualization and Computer Graphics. ,vol. 19, pp. 2012- 2021 ,(2013) , 10.1109/TVCG.2013.221
Shixia Liu, Weiwei Cui, Yingcai Wu, Mengchen Liu, None, A survey on information visualization: recent advances and challenges The Visual Computer. ,vol. 30, pp. 1373- 1393 ,(2014) , 10.1007/S00371-013-0892-3