作者: Luisa Bentivogli* , Beatrice Savoldi* , Matteo Negri , Mattia Antonino Di Gangi , Roldano Cattoni
DOI: 10.18653/V1/2020.ACL-MAIN.619
关键词: Benchmark (computing) 、 Natural language processing 、 Audio signal 、 Computer science 、 Sentence 、 Gender identity 、 Artificial intelligence 、 Speech translation 、 Grammatical gender 、 Machine translation 、 Natural language
摘要: Translating from languages without productive grammatical gender like English into gender-marked is a well-known difficulty for machines. This also due to the fact that training data on which models are built typically reflect asymmetries of natural languages, bias included. Exclusively fed with textual data, machine translation intrinsically constrained by input sentence does not always contain clues about identity referred human entities. But what happens speech translation, where an audio signal? Can provide additional information reduce bias? We present first thorough investigation in contributing with: i) release benchmark useful future studies, and ii) comparison different technologies (cascade end-to-end) two language directions (English-Italian/French).