Large scale entity-specific resource classification

作者: Sundararajan Sellamanickam , Mridul Muralidharan , Philip L. Bohannon , Ashwin Machanavajjhala , Sathiya K. Selvaraj

DOI:

关键词: Feature (machine learning)IdentifierWeak entityWeb pageHome pageRank (computer programming)Class (set theory)Set (abstract data type)Information retrievalComputer science

摘要: A system and method is described for large scale entity-specific classification of each set candidates in a collection specific entity entities. The entities may comprise category or domain (e.g. schools, restaurants, manufacturers, products, events, people). Candidates webpages other resources with resource identifiers. Entity sets be found by leveraging search engine query results user interaction therewith queries based on attributes. relationship(s) class(es) which candidate are being classified relative to an authoritative, official home page (OHP), class fan page, review, aggregator) entity. feature generator generates features candidates. In accordance its features, one more classifiers rank

参考文章(21)
Serhiy Kosinov, Marzia Polito, Carole Dulong, Igor Kozintsev, Method for personalized named entity recognition ,(2006)
Taek Nam, Seung Han, Su Choi, Chi Jeong, Apparatus and method for gathering of objectional web sites ,(2006)
Robert A. Kaplan, Identity access management system ,(2004)
Marco J. Zagha, James B. Harvey, Matvey Nemenman, Mohit Sabharwal, Charles C. Carson, Devika Chawla, Matching and ranking of sponsored search listings incorporating web search technology and web content ,(2006)
Wessel Kraaij, Thijs Westerveld, Djoerd Hiemstra, The Importance of Prior Probabilities for Entry Page Search Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '02. pp. 27- 34 ,(2002) , 10.1145/564376.564383
Amit Singhal, Marcin Kaszkiel, A case study in web search using TREC algorithms Proceedings of the tenth international conference on World Wide Web - WWW '01. pp. 708- 716 ,(2001) , 10.1145/371920.372186
Huaiyu Zhu, Sriram Raghavan, Shivakumar Vaithyanathan, Alexander Löser, Navigating the intranet with high precision the web conference. pp. 491- 500 ,(2007) , 10.1145/1242572.1242639