The ISL RT-06S Speech-to-Text System

作者: Christian Fügen , Shajith Ikbal , Florian Kraft , Kenichi Kumatani , Kornel Laskowski

DOI: 10.1007/11965152_36

关键词:

摘要: This paper describes the 2006 lecture and conference meeting speech-to-text system developed at Interactive Systems Laboratories (ISL), for individual head-mounted microphone (IHM), single distant (SDM), multiple (MDM) conditions, which was evaluated in RT-06S Rich Transcription Meeting Evaluation sponsored by US National Institute of Standards Technologies (NIST). We describe principal differences between our current those submitted previous years, namely improved acoustic language models, cross adaptation systems with different front-ends phoneme sets, use various automatic speech segmentation algorithms.

参考文章(37)
Sebastian Stüker, Christian Fügen, Matthias Wölfel, Florian Kraft, The ISL 2007 English Speech Transcription System for European Parliament Speeches conference of the international speech communication association. pp. 2609- 2612 ,(2007)
Qin Jin, Shajith Ikbal, Florian Kraft, Yik-Cheung Tam, Roger Hsiao, Matthias W ¨ olfel, Christian F ¨ ugen, Martin Raab, Matthias Paulik, The ISL TC-STAR Spring 2006 ASR Evaluation Systems ,(2006)
Sebastian Stüker, Mohamed Noamany, Qin Jin, Tanja Schultz, Yik-Cheung Tam, Hua Yu, Thomas Schaaf, The ISL RT04 Mandarin Broadcast News Evaluation System EARS Rich Transcription Workshop, New York, NY, 10. Nov. 2004. ,(2004)
Andreas Stolcke, Kofi Boakye, Improved speech activity detection using cross-channel features for recognition of multiparty meetings. conference of the international speech communication association. ,(2006)
Alex Waibe11, Hartwig Steusloff, Rainer Stiefelhagen, None, CHIL - Computers in the Human Interaction Loop. Journal of Machine Vision and Applications. pp. 18- 18 ,(2005)
Matthias Wölfel, John W. McDonough, Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination conference of the international speech communication association. pp. 3149- 3152 ,(2005)
Sebastian Stüker, Christian Fügen, Matthias Wölfel, Susanne Burger, Cross-System Adaptation and Combination for Continuous Speech Recognition: The Influence of Phoneme Set and Acoustic Front-End conference of the international speech communication association. ,(2006)
Christian Fügen, Matthias Wölfel, Shajith Ikbal, John W. McDonough, Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures conference of the international speech communication association. ,(2006)
Andreas Stolcke, Lidia Mangu, Eric Brill, Finding consensus among words : Lattice-based word error minimization conference of the international speech communication association. ,(1999)