Human computing and crowdsourcing methods for knowledge acquisition

作者: Sarath Kumar Kondreddi

DOI: 10.22028/D291-26564

关键词: Knowledge acquisitionCrowdsourcingLanguage modelUnstructured dataInformation extractionKnowledge extractionNatural languageInformation retrievalData scienceWordNetComputer science

摘要: Ambiguity, complexity, and diversity in natural language textual expressions are major hindrances to automated knowledge extraction. As a result state-of-the-art methods for extracting entities relationships from unstructured data make incorrect extractions or produce noise. With the advent of human computing, computationally hard tasks have been addressed through inputs. While textbased acquisition can benefit this approach, humans alone cannot bear burden vast resources that exist today. Even making payments crowdsourced quickly become prohibitively expensive. In thesis we present principled effectively garner computing inputs improving extraction knowledge-base facts texts. Our complement automatic techniques with reap benefits both while overcoming each other’s limitations. We architecture implementation HIGGINS , system combines an information (IE) engine (HC) high quality facts. Using methods, IE compiles dictionaries entity names relational phrases. It further statistics derived large Web corpora semantic like WordNet ConceptNet expand dictionary employs specifically designed statistical models phrase relatedness come up questions relevant candidate answers presented workers. Through extensive experiments establish superiority approach relation-centric text. our extract about fictitious characters narrative text, where issues complexity expressing relations far more pronounced. Finally, also demonstrate how interesting games be tasks.

参考文章(83)
Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, Zachary Ives, Sören Auer, Christian Bizer, DBpedia: a nucleus for a web of open data international semantic web conference. ,vol. 4825, pp. 722- 735 ,(2007) , 10.1007/978-3-540-76298-0_52
Cristina Sarasua, Elena Simperl, Natalya F Noy, None, CrowdMap: crowdsourcing ontology alignment with microtasks international semantic web conference. pp. 525- 541 ,(2012) , 10.1007/978-3-642-35176-1_33
Panagiotis G. Ipeirotis, Praveen K. Paritosh, Managing Crowdsourced Human Computation ,(2011)
Fabian Suchanek, Gerhard Weikum, Ndapandula Nakashole, PATTY: A Taxonomy of Relational Patterns with Semantic Types empirical methods in natural language processing. pp. 1135- 1145 ,(2012)
Sergey Brin, Extracting Patterns and Relations from the World Wide Web Lecture Notes in Computer Science. pp. 172- 183 ,(1999) , 10.1007/10704656_11
Michael J. Cafarella, Oren Etzioni, Stephen Soderland, Michele Banko, Matt Broadhead, Open information extraction from the web international joint conference on artificial intelligence. pp. 2670- 2676 ,(2007)
Omar Alonso, Ricardo Baeza-Yates, Design and Implementation of Relevance Assessments Using Crowdsourcing Lecture Notes in Computer Science. pp. 153- 164 ,(2011) , 10.1007/978-3-642-20161-5_16
Carl Vondrick, Donald Patterson, Deva Ramanan, Efficiently Scaling up Crowdsourced Video Annotation International Journal of Computer Vision. ,vol. 101, pp. 184- 204 ,(2013) , 10.1007/S11263-012-0564-1
Bill MacCartney, Marie-Catherine de Marneffe, Christopher D. Manning, Generating Typed Dependency Parses from Phrase Structure Parses language resources and evaluation. pp. 449- 454 ,(2006)