A Web Text Classification Rules Extraction Algorithm

作者: He Liu , Dayou Liu , Xiaohu Shi

DOI: 10.1109/ICNC.2008.231

关键词:

摘要: Text classification is a very important technique for gathering Web information. A novel approach based on multi-population collaborative optimization proposed the extraction of text rules. The information entropy was applied initialization populations and evolution populations. method to three benchmark test sets examine its effectiveness. Results show that precision higher those existing methods, cost computation less than methods. Furthermore, rules obtained by are simple compared with

参考文章(12)
Peter Clark, Robin Boswell, Rule induction with CN2: Some recent improvements Lecture Notes in Computer Science. pp. 151- 163 ,(1991) , 10.1007/BFB0017011
M. E. Maron, Automatic Indexing: An Experimental Inquiry Journal of the ACM. ,vol. 8, pp. 404- 417 ,(1961) , 10.1145/321075.321084
Sarah Zelikovitz, Haym Hirsh, Using LSI for text classification in the presence of background text Proceedings of the tenth international conference on Information and knowledge management - CIKM'01. pp. 113- 118 ,(2001) , 10.1145/502585.502605
Wai Lam, Chao Yang Ho, Using a generalized instance set for automatic text categorization international acm sigir conference on research and development in information retrieval. pp. 81- 89 ,(1998) , 10.1145/290941.290961
Robert E. Schapire, Yoram Singer, BoosTexter: A Boosting-based Systemfor Text Categorization Machine Learning. ,vol. 39, pp. 135- 168 ,(2000) , 10.1023/A:1007649029923
Jianhui Wang, A Simple and Efficient Algorithm to Classify a Large Scale of Texts Journal of Computer Research and Development. ,vol. 42, pp. 85- ,(2005) , 10.1360/CRAD20050112
Ron Bekkerman, Ran El-Yaniv, Naftali Tishby, Yoad Winter, On feature distributional clustering for text categorization international acm sigir conference on research and development in information retrieval. pp. 146- 153 ,(2001) , 10.1145/383952.383976
Yiming Yang, Expert network: effective and efficient learning from human decisions in text categorization and retrieval international acm sigir conference on research and development in information retrieval. pp. 13- 22 ,(1994) , 10.5555/188490.188496
Hua-Jun Zeng, Zheng Chen, Wei-Ying Ma, A unified framework for clustering heterogeneous Web objects web information systems engineering. pp. 161- 172 ,(2002) , 10.1109/WISE.2002.1181653
R.S. Parpinelli, H.S. Lopes, A.A. Freitas, Data mining with an ant colony optimization algorithm IEEE Transactions on Evolutionary Computation. ,vol. 6, pp. 321- 332 ,(2002) , 10.1109/TEVC.2002.802452