Frequent Patterns Based Word Network: What Can We Obtain from the Tourism Blogs?

作者： Hua Yuan , Lei Guo , Hualin Xu , Yong Xiang

关键词:

摘要: In this work, we present a method to extract interesting information for specific reader from massive tourism blog data. To end, first introduce the web crawler tool obtain contents and divide them into semantic word segments. Then, use frequent pattern mining discover useful 1- 2-itemset between words after necessary data cleaning. Third, visualize all correlations with network. Finally, propose local search based on max-confidence measurement that enables readers specify an topic find relevant contents. We illustrate benefits of approach by applying it Chinese online dataset.

参考文章(27)

Nitin Indurkhya, Fred J Damerau, None, Handbook of Natural Language Processing Chapman & Hall/CRC. ,(2010) , 10.1201/9781420085938

Fei Wang, Yunfang Wu, Mining market trend from blog titles based on lexical semantic similarity international conference on computational linguistics. pp. 261- 273 ,(2012) , 10.1007/978-3-642-28601-8_22

Vibhu Mittal, Mayur Datar, Hang Cui, Comparative experiments on sentiment classification for online product reviews national conference on artificial intelligence. pp. 1265- 1270 ,(2006)

Giuseppe Attardi, Maria Simi, Blog Mining Through Opinionated Words text retrieval conference. ,(2006)

Yang Liu, Xiaohui Yu, Xiangji Huang, Aijun An, Blog Data Mining: The Predictive Power of Sentiments Data Mining for Business Applications. pp. 183- 195 ,(2009) , 10.1007/978-0-387-79420-4_13

Xi Bai, Jigui Sun, Haiyan Che, Jin Wang, Towards knowledge extraction from weblogs and rule-based semantic querying rules and rule markup languages for the semantic web. pp. 215- 223 ,(2007) , 10.1007/978-3-540-75975-1_21

Alexander F. Gelbukh, Computational Linguistics and Intelligent Text Processing ,(2001)

Mohsen Jafari Asbagh, Mohsen Sayyadi, Hassan Abolhassani, Blog Summarization for Blog Mining software engineering, artificial intelligence, networking and parallel/distributed computing. pp. 157- 167 ,(2009) , 10.1007/978-3-642-01203-7_13

Qing Cao, Wenjing Duan, Qiwei Gan, Exploring determinants of voting for the helpfulness of online user reviews: A text mining approach decision support systems. ,vol. 50, pp. 511- 521 ,(2011) , 10.1016/J.DSS.2010.11.009

10.

Tianyi Wu, Yuguo Chen, Jiawei Han, Re-examination of interestingness measures in pattern mining: a unified framework Data Mining and Knowledge Discovery. ,vol. 21, pp. 371- 397 ,(2010) , 10.1007/S10618-009-0161-2

Frequent Patterns Based Word Network: What Can We Obtain from the Tourism Blogs?

来源期刊

我的账户

Frequent Patterns Based Word Network: What Can We Obtain from the Tourism Blogs?

来源期刊

相似文章 4

Travel Motivations of Domestic Film Tourists to the Hengdian World Studios: Serendipity, Traverse, and Mimicry

Mining graphs from travel blogs: a review in the context of tour planning

Framework of blog data based multi-criteria weighted points of interest graph for trip planning

Spatial information extraction from travel narratives: Analysing the notion of co-occurrence indicating closeness of tourist places:

我的账户