Logic2Text: High-Fidelity Natural Language Generation from Logical Forms

作者: William Yang Wang , Sairam Sundaresan , Wenhu Chen , Hanwen Zha , Zhiyu Chen

DOI:

关键词:

摘要: Previous works on Natural Language Generation (NLG) from structured data have primarily focused surface-level descriptions of record sequences. However, for complex data, e.g., multi-row tables, it is often desirable an NLG system to describe interesting facts logical inferences across records. If only provided with the table, hard existing models produce controllable and high-fidelity generations. In this work, we formulate level as generation forms in order obtain controllable, high-fidelity, faithful We present a new large-scale dataset, \textsc{Logic2Text}, 10,753 involving common logic types paired underlying forms. The show diversified graph structure free schema, which poses great challenges model's ability understand semantics. experiment (1) Fully-supervised training full datasets, (2) Few-shot setting, hundreds examples; compare several popular analyze their performances. hope our dataset can encourage research towards building advanced capable natural, faithful, human-like generation. code are available at https URL.

参考文章(35)
C DiMarco, H Dominic Covvey, Peter Bray, Donald Cowan, Vic DiCiccio, Eduard Hovy, Joan Lipa, Doug Mulholland, The Development of a Natural Language Generation System for Personalized e-Health Information Medinfo 2007: Proceedings of the 12th World Congress on Health (Medical) Informatics; Building Sustainable Health Systems. pp. 2339- ,(2007)
Jonathan Calder, Mike Reape, Henk Zeevat, An algorithm for generation in Unification Categorial Grammar Proceedings of the fourth conference on European chapter of the Association for Computational Linguistics -. pp. 233- 240 ,(1989) , 10.3115/976815.976847
EHUD REITER, ROBERT DALE, Building applied natural language generation systems Natural Language Engineering. ,vol. 3, pp. 57- 87 ,(1997) , 10.1017/S1351324997001502
John D. Phillips, Generation of text from logical formulae Machine Translation. ,vol. 8, pp. 209- 235 ,(1993) , 10.1007/BF00981757
Claire Gardent, Agnes Plainfossé, Generating from a deep structure Proceedings of the 13th conference on Computational linguistics -. ,vol. 2, pp. 127- 132 ,(1990) , 10.3115/997939.997961
Percy Liang, Michael I Jordan, Dan Klein, None, Learning Semantic Correspondences with Less Supervision international joint conference on natural language processing. pp. 91- 99 ,(2009) , 10.3115/1687878.1687893
Chandra Sekhar Bhagavatula, Thanapon Noraset, Doug Downey, Methods for exploring and mining tables on Wikipedia knowledge discovery and data mining. pp. 18- 26 ,(2013) , 10.1145/2501511.2501516
Jonathan May, SemEval-2016 Task 8: Meaning Representation Parsing. north american chapter of the association for computational linguistics. pp. 1063- 1073 ,(2016) , 10.18653/V1/S16-1166
Albert Gatt, Emiel Krahmer, Survey of the state of the art in natural language generation: core tasks, applications and evaluation Journal of Artificial Intelligence Research. ,vol. 61, pp. 65- 170 ,(2018) , 10.1613/JAIR.5477
Abigail See, Peter J. Liu, Christopher D. Manning, Get To The Point: Summarization with Pointer-Generator Networks meeting of the association for computational linguistics. ,vol. 1, pp. 1073- 1083 ,(2017) , 10.18653/V1/P17-1099