A fast approach to identify trending articles in hot topics from XML based big bibliographic datasets

作者: K. P. Swaraj , D. Manjula

DOI: 10.1007/S10586-016-0561-1

关键词:

摘要: Nowadays XML based big bibliographic datasets are common in different domains which provide meta data about articles published that domain. They have well defined tags give details of the year, title, authors, abstract, keywords, type article, venue publishing article and other such specific each article. A lot statistics can be extracted from this dataset. Most time tag pertaining to domain sub topic information associated with will absent dataset as it is not an attribute. Hence for must mapped its This paper investigates problem proposes a fast approach find trending hot topics datasets. The proposed framework uses ontology first classify into topics. Fast detection topics, keywords achieved using novel Map Reduce algorithms implemented on hadoop distributed framework. Performance comparison demonstrates outperforms non-Map counterpart quickly sorting out titles particular

参考文章(27)
Gridaphat Sriharee, Ravikarn Punnarut, A researcher expertise search system using ontology-based data mining asia pacific conference on conceptual modelling. pp. 71- 78 ,(2010)
Michael Ley, The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives string processing and information retrieval. pp. 1- 10 ,(2002) , 10.1007/3-540-45735-6_1
Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)
Saleh Alwahaishi, Jan Martinovič, Václav Snášel, Analysis of the DBLP Publication Classification Using Concept Lattices digital enterprise and information systems. pp. 99- 108 ,(2011) , 10.1007/978-3-642-22603-8_10
Kumar Shubhankar, Aditya Pratap Singh, Vikram Pudi, An Efficient Algorithm for Topic Ranking and Modeling Topic Evolution Lecture Notes in Computer Science. pp. 320- 330 ,(2011) , 10.1007/978-3-642-23088-2_23
Rongxin Chen, Husheng Liao, ParaParse: A parallel method for XML parsing ieee international conference on communication software and networks. pp. 81- 85 ,(2011) , 10.1109/ICCSN.2011.6014223
Gamila Obadi, Pavla Drazdilova, Lukas Hlavacek, Jan Martinovic, Vaclav Snasel, A Tolerance Rough Set Based Overlapping Clustering for the DBLP Data web intelligence. ,vol. 3, pp. 57- 60 ,(2010) , 10.1109/WI-IAT.2010.286
T. L. Griffiths, M. Steyvers, Finding scientific topics Proceedings of the National Academy of Sciences of the United States of America. ,vol. 101, pp. 5228- 5235 ,(2004) , 10.1073/PNAS.0307752101
Zheng Fen, Xu Yabin, Li Yanping, Research on Internet Hot Topic Detection Based on MapReduce Architecture international conference on intelligent human-machine systems and cybernetics. ,vol. 1, pp. 81- 84 ,(2012) , 10.1109/IHMSC.2012.26