作者: Francois Mairesse , Filip Jurcicek , Milica Gasic , Blaise Thomson , Steve Young
DOI:
关键词: Artificial intelligence 、 Set (abstract data type) 、 Generator (mathematics) 、 Rank (computer programming) 、 Statistical model 、 Dynamic Bayesian network 、 Natural language processing 、 Active learning (machine learning) 、 Graphical model 、 Active learning 、 Machine learning 、 Phrase 、 Computer science
摘要: Most previous work on trainable language generation has focused two paradigms: (a) using a statistical model to rank set of generated utterances, or (b) statistics inform the decision process. Both approaches rely existence handcrafted generator, which limits their scalability new domains. This paper presents Bagel, generator uses dynamic Bayesian networks learn from semantically-aligned data produced by 42 untrained annotators. A human evaluation shows that Bagel can generate natural and informative utterances unseen inputs in information presentation domain. Additionally, performance sparse datasets is improved significantly certainty-based active learning, yielding ratings close gold standard with fraction data.