Empathetic BERT2BERT Conversational Model: Learning Arabic Language Generation with Little Data

作者: Hazem Hajj , Reem A. Mahmoud , Wissam Antoun , Tarek Naous

DOI:

关键词: EmpathyArabicBLEUTransformer (machine learning model)PerplexityLeverage (statistics)Natural language processingKnowledge transferNatural language generationLanguage modelNatural language understandingComputer scienceArtificial intelligence

摘要: Enabling empathetic behavior in Arabic dialogue agents is an important aspect of building human-like conversational models. While Natural Language Processing has seen significant advances Understanding (NLU) with language models such as AraBERT, Generation (NLG) remains a challenge. The shortcomings NLG encoder-decoder are primarily due to the lack datasets suitable train agents. To overcome this issue, we propose transformer-based initialized AraBERT parameters. By initializing weights encoder and decoder pre-trained weights, our model was able leverage knowledge transfer boost performance response generation. enable empathy model, it using ArabicEmpatheticDialogues dataset achieve high Specifically, achieved low perplexity value 17.0 increase 5 BLEU points compared previous state-of-the-art model. Also, proposed rated highly by 85 human evaluators, validating its capability exhibiting while generating relevant fluent responses open-domain settings.

参考文章(29)
W. Gerrod Parrott, Emotions in social psychology : essential readings Published in <b>2001</b> in Philadelphia Pa) by Psychology press. ,(2001)
Mohammad Hijjawi, Zuhair Bandar, Keeley Crockett, David Mclean, ArabChat: An Arabic Conversational Agent international conference on computer science and information technology. pp. 227- 237 ,(2014) , 10.1109/CSIT.2014.6806005
Ahmed Abdelali, Kareem Darwish, Nadir Durrani, Hamdy Mubarak, Farasa: A Fast and Furious Segmenter for Arabic north american chapter of the association for computational linguistics. pp. 11- 16 ,(2016) , 10.18653/V1/N16-3003
Nizar Habash, Dana Abu Ali, Botta: An Arabic Dialect Chatbot international conference on computational linguistics. pp. 208- 212 ,(2016)
Douwe Kiela, Arthur Szlam, Jason Weston, Saizheng Zhang, Jack Urbanek, Emily Dinan, Personalizing Dialogue Agents: I have a dog, do you have pets too? arXiv: Artificial Intelligence. ,(2018)
Yukun Zhu, Ryan Kiros, Rich Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, Sanja Fidler, None, Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books arXiv: Computer Vision and Pattern Recognition. ,(2015)
Daphne Ippolito, Reno Kriz, Joao Sedoc, Maria Kustikova, Chris Callison-Burch, Comparison of Diverse Decoding Methods from Conditional Language Models meeting of the association for computational linguistics. pp. 3752- 3762 ,(2019) , 10.18653/V1/P19-1365
Özge Nilay Yalçın, Steve DiPaola, M-Path: A Conversational System for the Empathic Virtual Agent Advances in Intelligent Systems and Computing. pp. 597- 607 ,(2020) , 10.1007/978-3-030-25719-4_78
Angela Fan, Mike Lewis, Yann Dauphin, Hierarchical Neural Story Generation meeting of the association for computational linguistics. ,vol. 1, pp. 889- 898 ,(2018) , 10.18653/V1/P18-1082
Guillaume Klein, Yoon Kim, Yuntian Deng, Jean Senellart, Alexander Rush, OpenNMT: Open-Source Toolkit for Neural Machine Translation meeting of the association for computational linguistics. pp. 67- 72 ,(2017) , 10.18653/V1/P17-4012