作者: Uday Kamath , John Liu , James Whitaker , Uday Kamath , John Liu
DOI: 10.1007/978-3-030-14596-5_8
关键词:
摘要: Automatic speech recognition (ASR) has grown tremendously in recent years, with deep learning playing a key role. Simply put, ASR is the task of converting spoken language into computer readable text (Fig. 8.1). It quickly become ubiquitous today as useful way to interact technology, significantly bridging gap human–computer interaction, making it more natural.