A Trend Discovery System for Dynamic Web Content Mining

作者: A. Méndez-Torreblanca , A. López-López , Luís Enrique Erro , M. Montes y-Gómez

DOI:

关键词:

摘要: The rapid expansion of the web is causing constant growth information, leading to several problems such as an increased difficulty extracting potentially useful knowledge. Web content mining confronts this problem gathering explicit information from different sites for its access and knowledge discovery. Its current methods focus on analyzing static cannot deal with constantly changing sites, news sites. In paper, we propose a method online This applies dynamic schemes exploring these reports, uses domain independent statistical analysis trend analysis. overall application that goes beyond straightforward analysis, trying understand society interests measure social importance ongoing events.

参考文章(10)
Alexander Gelbukh, Grigori Sidorov, Adolfo Guzman-Arénas, Use of a Weighted Topic Hierarchy for Document Classification text speech and dialogue. pp. 133- 138 ,(1999) , 10.1007/3-540-48239-3_24
Alexander Gelbukh, Aurelio López López, Manuel Montes y Gómez, Mining the News: Trends, Associations, and Deviations Computación y Sistemas. ,vol. 5, pp. 14- 24 ,(2001) , 10.13053/CYS-5-1-965
Lisa Singh, Bin Chen, Rebecca Haight, Peter Scheuermann, An Algorithm for Constrained Association Rule Mining in Semi-structured Data pacific asia conference on knowledge discovery and data mining. pp. 148- 158 ,(1999) , 10.1007/3-540-48912-6_21
Raymond Kosala, Hendrik Blockeel, Web mining research: a survey Sigkdd Explorations. ,vol. 2, pp. 1- 15 ,(2000) , 10.1145/360402.360406
L.S. Gay, W.B. Croft, Interpreting nominal compounds for information retrieval Information Processing and Management. ,vol. 26, pp. 21- 38 ,(1990) , 10.1016/0306-4573(90)90007-O
F. Crimmins, A.F. Smeaton, T. Dkaki, J. Mothe, TetraFusion: information discovery on the Internet IEEE Intelligent Systems & Their Applications. ,vol. 14, pp. 55- 62 ,(1999) , 10.1109/5254.784085
N. Kushmerik, Gleaning the Web IEEE Intelligent Systems & Their Applications. ,vol. 14, pp. 20- 22 ,(1999) , 10.1109/5254.757626
Oren Etzioni, The World-Wide Web Communications of the ACM. ,vol. 39, pp. 65- 68 ,(1996) , 10.1145/240455.240473
Stephen Soderland, Learning Information Extraction Rules for Semi-Structured and Free Text Machine Learning. ,vol. 34, pp. 233- 272 ,(1999) , 10.1023/A:1007562322031