Cross-lingual Emotion Intensity Prediction

作者: Toni Badia , Jeremy Barnes , Irean Navas Alejo

DOI:

关键词:

摘要: Emotion intensity prediction determines the degree or of an emotion that author expresses in a text, extending previous categorical approaches to detection. While most work on this topic has concentrated English texts, other languages would also benefit from fine-grained classification, preferably without having recreate amount annotated data available each new language. Consequently, we explore cross-lingual transfer for detection Spanish and Catalan tweets. To end annotate test set tweets using Best-Worst scaling. We compare six approaches, e.g., machine translation embeddings, which have varying requirements parallel – millions sentences completely unsupervised. The results show data, methods with low parallel-data perform surprisingly better than use more explain through in-depth error analysis. make dataset code at https://github.com/jerbarnes/fine-grained_cross-lingual_emotion.

参考文章(39)
Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova, BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding north american chapter of the association for computational linguistics. pp. 4171- 4186 ,(2018) , 10.18653/V1/N19-1423
Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, Illia Polosukhin, None, Attention is All You Need neural information processing systems. ,vol. 30, pp. 5998- 6008 ,(2017)
Xilun Chen, Yu Sun, Ben Athiwaratkun, Claire Cardie, Kilian Weinberger, Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification Transactions of the Association for Computational Linguistics. ,vol. 6, pp. 557- 570 ,(2018) , 10.1162/TACL_A_00039
Jeremy Barnes, Roman Klinger, Sabine Schulte im Walde, Bilingual Sentiment Embeddings: Joint Projection of Sentiment Across Languages Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). ,vol. 1, pp. 2483- 2493 ,(2018) , 10.18653/V1/P18-1231
Jeremy Barnes, Roman Klinger, Sabine Schulte im Walde, Assessing State-of-the-Art Sentiment Models on State-of-the-Art Sentiment Datasets empirical methods in natural language processing. pp. 2- 12 ,(2017) , 10.18653/V1/W17-5202
Svetlana Kiritchenko, Saif M. Mohammad, Capturing Reliable Fine-Grained Sentiment Associations by Crowdsourcing and Best–Worst Scaling north american chapter of the association for computational linguistics. pp. 811- 817 ,(2016) , 10.18653/V1/N16-1095
Shijie Wu, Mark Dredze, Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). pp. 833- 844 ,(2019) , 10.18653/V1/D19-1077
Jeremy Barnes, Roman Klinger, Embedding Projection for Targeted Cross-lingual Sentiment: Model Comparisons and a Real-World Study Journal of Artificial Intelligence Research. ,vol. 66, pp. 691- 742 ,(2019) , 10.1613/JAIR.1.11561
Bjarke Felbo, Alan Mislove, Anders Søgaard, Iyad Rahwan, Sune Lehmann, Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm empirical methods in natural language processing. pp. 1615- 1625 ,(2017) , 10.18653/V1/D17-1169
Saima Aman, Stan Szpakowicz, Identifying expressions of emotion in text text speech and dialogue. pp. 196- 205 ,(2007) , 10.1007/978-3-540-74628-7_27