Frequent Patterns Based Word Network: What Can We Obtain from the Tourism Blogs?

作者: Hua Yuan , Lei Guo , Hualin Xu , Yong Xiang

DOI: 10.1007/978-3-642-39787-5_2

关键词:

摘要: In this work, we present a method to extract interesting information for specific reader from massive tourism blog data. To end, first introduce the web crawler tool obtain contents and divide them into semantic word segments. Then, use frequent pattern mining discover useful 1- 2-itemset between words after necessary data cleaning. Third, visualize all correlations with network. Finally, propose local search based on max-confidence measurement that enables readers specify an topic find relevant contents. We illustrate benefits of approach by applying it Chinese online dataset.

参考文章(27)
Nitin Indurkhya, Fred J Damerau, None, Handbook of Natural Language Processing Chapman & Hall/CRC. ,(2010) , 10.1201/9781420085938
Fei Wang, Yunfang Wu, Mining market trend from blog titles based on lexical semantic similarity international conference on computational linguistics. pp. 261- 273 ,(2012) , 10.1007/978-3-642-28601-8_22
Vibhu Mittal, Mayur Datar, Hang Cui, Comparative experiments on sentiment classification for online product reviews national conference on artificial intelligence. pp. 1265- 1270 ,(2006)
Giuseppe Attardi, Maria Simi, Blog Mining Through Opinionated Words text retrieval conference. ,(2006)
Yang Liu, Xiaohui Yu, Xiangji Huang, Aijun An, Blog Data Mining: The Predictive Power of Sentiments Data Mining for Business Applications. pp. 183- 195 ,(2009) , 10.1007/978-0-387-79420-4_13
Xi Bai, Jigui Sun, Haiyan Che, Jin Wang, Towards knowledge extraction from weblogs and rule-based semantic querying rules and rule markup languages for the semantic web. pp. 215- 223 ,(2007) , 10.1007/978-3-540-75975-1_21
Mohsen Jafari Asbagh, Mohsen Sayyadi, Hassan Abolhassani, Blog Summarization for Blog Mining software engineering, artificial intelligence, networking and parallel/distributed computing. pp. 157- 167 ,(2009) , 10.1007/978-3-642-01203-7_13
Qing Cao, Wenjing Duan, Qiwei Gan, Exploring determinants of voting for the helpfulness of online user reviews: A text mining approach decision support systems. ,vol. 50, pp. 511- 521 ,(2011) , 10.1016/J.DSS.2010.11.009
Tianyi Wu, Yuguo Chen, Jiawei Han, Re-examination of interestingness measures in pattern mining: a unified framework Data Mining and Knowledge Discovery. ,vol. 21, pp. 371- 397 ,(2010) , 10.1007/S10618-009-0161-2