Frugal method and system for creating speech corpus

作者: Sunil Kumar Kopparapu , Imran Ahmed Sheikh

DOI:

关键词:

摘要: The present invention provides a frugal method for extraction of speech data and associated transcription from plurality web resources (internet) corpus creation characterized by an automation the cost reduction. An integration existing with extracted its to build aggregated rich that are effective easy adapt generating acoustic language models (Automatic Speech Recognition) ASR systems.

参考文章(8)
Jeff Madison, Jeffrey L. Brimhall, Computerized credibility scoring ,(2008)
Frederic Bechet, Dilek Hakkani-Tur, Jeremy Wright, Allen Gorin, Method and system for creating a named entity language model ,(2003)
Bradley C. Lackey, Method of recognizing phones in speech of any language Journal of the Acoustical Society of America. ,vol. 125, pp. 588- ,(2004) , 10.1121/1.3074496
Michael Finke, Juergen Fritsch, Koll Detlef, Automatic detection and application of editing patterns in draft documents ,(2006)
Patrick Düssel, Konrad Rieck, Klaus-Robert Müller, Pavel Laskov, A method and apparatus for automatic comparison of data sequences ,(2006)