Generating Natural Language from Linked Data: Unsupervised template extraction

作者: Ewan Klein , Daniel Duma

DOI:

关键词: Linked dataInformation retrievalRDFSentenceSimple (abstract algebra)Natural language processingNatural languageArtificial intelligenceComputer scienceBaseline (configuration management)Structure (mathematical logic)Coherence (linguistics)

摘要: We propose an architecture for generating natural language from Linked Data that automatically learns sentence templates and statistical document planning parallel RDF datasets text. have built a proof-of-concept system (LOD-DEF) trained on un-annotated text the Simple English Wikipedia triples DBpedia, focusing exclusively factual, non-temporal information. The goal of is to generate short descriptions, equivalent stubs, entities found in Datasets. evaluated LOD-DEF against simple generate-from-triples baseline human-generated output. In evaluation by humans, significantly outperforms two three measures: non-redundancy structure coherence.

参考文章(13)
Michel Gagnon, Lyne Da Sylva, Text Compression by Syntactic Pruning Advances in Artificial Intelligence. pp. 312- 323 ,(2006) , 10.1007/11766247_27
Ehud Reiter, Robert Dale, Building Natural Language Generation Systems ,(2000)
Christian Bizer, Bebo White, Tom Heath, Linked Data: Evolving the Web into a Global Data Space ,(2011)
Xiantang Sun, Chris Mellish, An Experiment on "Free Generation" from Single RDF Triples natural language generation. pp. 105- 108 ,(2007) , 10.3115/1610163.1610181
Pablo A. Duboue, Kathleen R. McKeown, Statistical acquisition of content selection rules for natural language generation Proceedings of the 2003 conference on Empirical methods in natural language processing -. pp. 121- 128 ,(2003) , 10.3115/1119355.1119371
Katja Filippova, Michael Strube, Dependency tree based sentence compression international conference on natural language generation. pp. 25- 32 ,(2008) , 10.3115/1708322.1708329
Dan Klein, Christopher D. Manning, Accurate unlexicalized parsing Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - ACL '03. pp. 423- 430 ,(2003) , 10.3115/1075096.1075150
T. A. Cohn, M. Lapata, Sentence compression as tree transduction Journal of Artificial Intelligence Research. ,vol. 34, pp. 637- 674 ,(2009) , 10.1613/JAIR.2655
Rada Mihalcea, Andras Csomai, Wikify!: linking documents to encyclopedic knowledge conference on information and knowledge management. pp. 233- 242 ,(2007) , 10.1145/1321440.1321475