A fast approach to identify trending articles in hot topics from XML based big bibliographic datasets

作者： K. P. Swaraj , D. Manjula

关键词:

摘要: Nowadays XML based big bibliographic datasets are common in different domains which provide meta data about articles published that domain. They have well defined tags give details of the year, title, authors, abstract, keywords, type article, venue publishing article and other such specific each article. A lot statistics can be extracted from this dataset. Most time tag pertaining to domain sub topic information associated with will absent dataset as it is not an attribute. Hence for must mapped its This paper investigates problem proposes a fast approach find trending hot topics datasets. The proposed framework uses ontology first classify into topics. Fast detection topics, keywords achieved using novel Map Reduce algorithms implemented on hadoop distributed framework. Performance comparison demonstrates outperforms non-Map counterpart quickly sorting out titles particular

参考文章(27)

Gridaphat Sriharee, Ravikarn Punnarut, A researcher expertise search system using ontology-based data mining asia pacific conference on conceptual modelling. pp. 71- 78 ,(2010)

Michael Ley, The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives string processing and information retrieval. pp. 1- 10 ,(2002) , 10.1007/3-540-45735-6_1

Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)

Saleh Alwahaishi, Jan Martinovič, Václav Snášel, Analysis of the DBLP Publication Classification Using Concept Lattices digital enterprise and information systems. pp. 99- 108 ,(2011) , 10.1007/978-3-642-22603-8_10

Kumar Shubhankar, Aditya Pratap Singh, Vikram Pudi, An Efficient Algorithm for Topic Ranking and Modeling Topic Evolution Lecture Notes in Computer Science. pp. 320- 330 ,(2011) , 10.1007/978-3-642-23088-2_23

Liangxiu Han, Hwee Yong Ong, Parallel data intensive applications using MapReduce: a data mining case study in biomedical sciences Cluster Computing. ,vol. 18, pp. 403- 418 ,(2015) , 10.1007/S10586-014-0405-9

Rongxin Chen, Husheng Liao, ParaParse: A parallel method for XML parsing ieee international conference on communication software and networks. pp. 81- 85 ,(2011) , 10.1109/ICCSN.2011.6014223

Gamila Obadi, Pavla Drazdilova, Lukas Hlavacek, Jan Martinovic, Vaclav Snasel, A Tolerance Rough Set Based Overlapping Clustering for the DBLP Data web intelligence. ,vol. 3, pp. 57- 60 ,(2010) , 10.1109/WI-IAT.2010.286

T. L. Griffiths, M. Steyvers, Finding scientific topics Proceedings of the National Academy of Sciences of the United States of America. ,vol. 101, pp. 5228- 5235 ,(2004) , 10.1073/PNAS.0307752101

10.

Zheng Fen, Xu Yabin, Li Yanping, Research on Internet Hot Topic Detection Based on MapReduce Architecture international conference on intelligent human-machine systems and cybernetics. ,vol. 1, pp. 81- 84 ,(2012) , 10.1109/IHMSC.2012.26

A fast approach to identify trending articles in hot topics from XML based big bibliographic datasets

来源期刊

我的账户

A fast approach to identify trending articles in hot topics from XML based big bibliographic datasets

来源期刊

相似文章 6

Computer-supported portfolio analysis and comparison using ontology-based patent classification mapping scheme: the case of mobile communication patent pools

XML Data Analysis : Recent Review in Scope of Association Rule Generation

Keyword Based Identification of Thrust Area Using MapReduce for Knowledge Discovery

Using hybrid algorithmic-crowdsourcing methods for academic knowledge acquisition

Research on multi-feature fusion algorithm for subject words extraction and summary generation of text

Research on topic discovery technology for Web news

我的账户