作者: Georg Heigold , Ignacio Moreno , Samy Bengio , Noam Shazeer
DOI: 10.1109/ICASSP.2016.7472652
关键词:
摘要: In this paper we present a data-driven, integrated approach to speaker verification, which maps test utterance and few reference utterances directly single score for verification jointly optimizes the system's components using same evaluation protocol metric as at time. Such an will result in simple efficient systems, requiring little domain-specific knowledge making model assumptions. We implement idea by formulating problem neural network architecture, including estimation of on only utterances, evaluate it our internal "Ok Google" benchmark text-dependent verification. The proposed appears be very effective big data applications Like ours that require highly accurate, easy-to-maintain systems with small footprint.