NMiner: A System for Finding Related Entities by Mining a Bimodal Network

作者: VenkataSwamy Martha , Stephen Wallace , Halil Bisgin , Xiaowei Xu , Nitin Agarwal

DOI: 10.1007/978-3-642-35341-3_30

关键词:

摘要: Motivated from related entity finding problem, in this paper, we introduce a novel approach to query answering called “NMiner.” NMiner takes advantage of heuristics find answers complex semantic queries. It uses combination natural language processing techniques parse sentences and extract entities, hypertext structure the documents derive relational information, web data relevant entities as search result candidates. Further, bimodal network is created Content Centric Ranking (CCR) Cumulative Structural Similarity (CSS), are proposed rank candidate entities. Our empirical study on ClueWeb09 corpus (with approximately 25 terabytes documents) shows that both CSS CCR outperform PageRank HITS. Moreover, proved be significant solving problem queries performed against largely unstructured text documents.

参考文章(26)
Frank Emmert Streib, Jürgen Kilian, Alexander Mehler, Matthias Dehmer, Measuring the Structural Similarity of Web-based Documents: A Novel Approach World Academy of Science, Engineering and Technology, International Journal of Computer, Electrical, Automation, Control and Information Engineering. ,vol. 1, pp. 3057- 3063 ,(2007)
Michele Minno, Davide Palmisano, Michele Mostarda, Slicing linked data by extracting significant, self-describing subsets: the DBpedia case international conference on web engineering. pp. 223- 231 ,(2010) , 10.1007/978-3-642-16985-4_20
Roberto Mirizzi, Azzurra Ragone, Tommaso Di Noia, Eugenio Di Sciascio, Ranking the Linked Data: The Case of DBpedia Lecture Notes in Computer Science. pp. 337- 354 ,(2010) , 10.1007/978-3-642-13911-6_23
A. Goker, T. L. McCluskey, Towards an Adaptive Information Retrieval System international syposium on methodologies for intelligent systems. pp. 348- 357 ,(1991) , 10.1007/3-540-54563-8_98
Alexander Mehler, Matthias Dehmer, Rüdiger Gleim, Towards logical hypertext structure IICS'04 Proceedings of the 4th international conference on Innovative Internet Community Systems. pp. 136- 150 ,(2004) , 10.1007/11553762_14
Jon M. Kleinberg, Authoritative sources in a hyperlinked environment symposium on discrete algorithms. pp. 668- 677 ,(1998) , 10.5555/314613.315045
Gabriella Kazai, Antoine Doucet, Overview of the INEX 2007 Book Search track: BookSearch '07 international acm sigir conference on research and development in information retrieval. ,vol. 42, pp. 2- 15 ,(2008) , 10.1145/1394251.1394253
David Nadeau, Satoshi Sekine, A survey of named entity recognition and classification Lingvisticae Investigationes. ,vol. 30, pp. 3- 26 ,(2007) , 10.1075/LI.30.1.03NAD
Sergey Brin, Lawrence Page, The anatomy of a large-scale hypertextual Web search engine the web conference. ,vol. 30, pp. 107- 117 ,(1998) , 10.1016/S0169-7552(98)00110-X