Building the Multidimensional Semantic Index of Webpages for Facet Extraction

作者: Xiao Wei , Chenglei Qin , Zheng Xu

DOI: 10.4018/IJCINI.2015040101

关键词:

摘要: Faceted search is an efficient search method to use the big data and one of its key issues is to extract facets from unstructured webpages automatically. It is still a problem to extract facets from massive unstructured webpages exactly and automatically. To solve the problem, this paper first proposed a novel index structure of webpages, the Multidimensional Semantic Index (MDSI), which holds rich semantics and are helpful to extract facets. In MDSI, the differently dimensional semantic indexes are bridged by mining the semantic mapping between them. Then, an automatic facet extraction method is proposed by analysing semantic mapping relations in MDSI. At last, to validate the effect of the proposed method, two datasets are constructed and the experimental results show that the proposed method is feasible and comparatively precise.

参考文章(25)
Gary Marchionini, Ben Brunk, Towards a General Relation Browser: A GUI for Information Architects Journal of Digital Information. ,vol. 4, ,(2003)
Yingxu Wang, James A. Anderson, George Baciu, Gerhard Budin, D. Frank Hsu, Mitsuru Ishizuka, Witold Kinsner, Fumio Mizoguchi, Toyoaki Nishida, Kenji Sugawara, Shusaku Tsumoto, Du Zhang, Perspectives on eBrain and Cognitive Computing International Journal of Cognitive Informatics and Natural Intelligence. ,vol. 6, pp. 1- 21 ,(2012) , 10.4018/JCINI.2012100101
Zheng Xu, Xiangfeng Luo, Jie Yu, Weimin Xu, Measuring semantic similarity between words by removing noise and redundancy in web snippets Concurrency and Computation: Practice and Experience. ,vol. 23, pp. 2496- 2510 ,(2011) , 10.1002/CPE.1816
Xiao Wei, Xiangfeng Luo, Qing Li, Improving the Compression Efficiency for News Web Service Using Semantic Relations Among Webpages International Journal of Cognitive Informatics and Natural Intelligence. ,vol. 7, pp. 49- 64 ,(2013) , 10.4018/IJCINI.2013040104
Xiao Wei, Xiangfeng Luo, Qing Li, Jun Zhang, Zheng Xu, Online Comment-Based Hotel Quality Automatic Assessment Using Improved Fuzzy Comprehensive Evaluation and Fuzzy Cognitive Map IEEE Transactions on Fuzzy Systems. ,vol. 23, pp. 72- 84 ,(2015) , 10.1109/TFUZZ.2015.2390226
Yingxu Wang, George Baciu, Yiyu Yao, Witold Kinsner, Keith Chan, Bo Zhang, Stuart Hameroff, Ning Zhong, Chu-Ren Hunag, Ben Goertzel, Duoqian Miao, Kenji Sugawara, Guoyin Wang, Jane You, Du Zhang, Haibin Zhu, None, Perspectives on Cognitive Informatics and Cognitive Computing International Journal of Cognitive Informatics and Natural Intelligence. ,vol. 4, pp. 1- 29 ,(2010) , 10.4018/JCINI.2010010101
Junyu Xuan, Xiangfeng Luo, Shunxiang Zhang, Zheng Xu, Huimin Liu, Feiyue Ye, Building Hierarchical Keyword Level Association Link Networks for Web Events Semantic Analysis ieee international conference on dependable, autonomic and secure computing. pp. 987- 994 ,(2011) , 10.1109/DASC.2011.163
Yingxu Wang, Bernard Carlos Widrow, Bo Zhang, Witold Kinsner, Kenji Sugawara, Fuchun Sun, Jianhua Lu, Thomas Weise, Du Zhang, Perspectives on the Field of Cognitive Informatics and its Future Development International Journal of Cognitive Informatics and Natural Intelligence. ,vol. 5, pp. 1- 17 ,(2011) , 10.4018/JCINI.2011010101
Shunxiang Zhang, Xiangfeng Luo, Junyu Xuan, Xue Chen, Weimin Xu, Discovering small-world in association link networks for web-based learning Proceedings of the third international ACM workshop on Multimedia technologies for distance learning - MTDL '11. pp. 19- 24 ,(2011) , 10.1145/2072598.2072603