AMIDA/Klewel Mini-Project

作者: Petr Motlicek , Philip N. Garner , Vincent Bozzo , Maël Guillemot

DOI:

关键词:

摘要: The goal of the AMIDA mini-project is to transfer some technologies developed within project be used by a Klewel retrieval system. More specifically, main focus develop speech-to-text application based on Automatic Speech Recognition (ASR) system which could potentially implemented in their conference webcasting First, this document describes experimental setup and results achieved devoted automatic processing real lecture recordings provided Klewel. Then, demonstrator — an created for demonstrating results—is described.

参考文章(20)
Martin Karafiát, Jan Cernocký, Lukás Burget, Thomas Hain, Application of CMLLR in narrow band wide band adapted systems. conference of the international speech communication association. pp. 282- 285 ,(2007)
Susanne Burger, Hua Yu, Victoria MacLaren, The ISL meeting corpus: the impact of meeting type on speech style. conference of the international speech communication association. ,(2002)
Alessandro Vinciarelli, Hervé Bourlard, Artem Peregoudov, Towards using slide information to enhance speech transcription of meetings IDIAP. ,(2006)
Thomas Hain, Lukas Burget, John Dines, Giulia Garau, Martin Karafiat, Mike Lincoln, Jithendra Vepa, Vincent Wan, The AMI Meeting Transcription System: Progress and Performance Machine Learning for Multimodal Interaction. pp. 419- 431 ,(2006) , 10.1007/11965152_37
Elham Tabassi, Christophe D. Laprun, John S. Garofolo, Vincent M. Stanford, Martial Michel, The NIST Meeting Room Pilot Corpus language resources and evaluation. ,(2004)
W. Kraaij, M. Kronenthal, A. Lisowska, I. McCowan, V. Karaiskos, J. Carletta, P. Wellner, S. Bourban, T. Hain, S. Ashby, G. Lathoud, W. Post, M. Lincoln, J. Kadlec, M. Guillemot, Dennis Reidsma, M. Flynn, The AMI meeting corpus Symposium on Annotating and Measuring Meeting Behavior. pp. 137- 140 ,(2005)
Mark Ordowski, Mark A. Przybocki, Alvin F. Martin, George R. Doddington, Terri Kamm, The DET Curve in Assessment of Detection Task Performance conference of the international speech communication association. ,(1997)
A. Janin, D. Baron, J. Edwards, D. Ellis, D. Gelbart, N. Morgan, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, C. Wooters, The ICSI Meeting Corpus international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 364- 367 ,(2003) , 10.1109/ICASSP.2003.1198793
Martin Karafiat, Danil Korchagin, Philip N. Garner, Vincent Wan, Thomas Hain, John Dines, Mike Lincoln, Le Zhang, Asmaa El Hannani, Real-Time ASR from Meetings conference of the international speech communication association. pp. 2119- 2122 ,(2009)
Cha Zhang, Yong Rui, Jim Crawford, Li-Wei He, An automated end-to-end lecture capture and broadcasting system ACM Transactions on Multimedia Computing, Communications, and Applications. ,vol. 4, pp. 1- 23 ,(2008) , 10.1145/1324287.1324293