Multiple recognizer speech recognition

作者: Fadi Biadsy , Pedro J. Moreno Mengibar , Petar Aleksic

DOI:

关键词: Natural language processingLanguage modelUtteranceEmbodied cognitionArtificial intelligenceSubject matterLimited speechTranscription (software)GrammarComputer scienceSpeech recognitionVocabulary

摘要: The invention relates to multiple recognizer speech recognition. subject matter of this specification can be embodied in, among other things, a method that includes receiving audio data corresponds an utterance, obtaining first transcription the utterance was generated using limited recognizer. includesa language model is trained over recognition vocabulary one or more terms from voice command grammar, but fewer than all expanded grammar. A second obtained classified based at least on portion transcription.

参考文章(37)
Adam Soroca, Neal J. Karasic, Jorey Ramer, Dennis Doughty, Mobile Communication Facility Usage Pattern Geographic Based Advertising ,(2011)
Kui Xu, Fuliang Weng, Zhe Feng, Lin Zhao, Speech recognition using multiple language models ,(2012)
David Suendermann, Krishna Dayanidhi, Roberto Pieraccini, Jackson Liscombe, System and method for building optimal state-dependent statistical utterance classifiers in spoken dialog systems ,(2009)
Pieter J. Verneulen, Todd F. Mozer, Background speech recognition assistant ,(2011)
Ram Aringunrum, Leah Pearlman, Timothy Sharpe, Duoc Nguyen, Raghuveer Simha, John S. Holmes, Eric Badger, Stacia Scott, John R. Selbie, Alexandra Heron, Ahmed Azmy Hassan, Asynchronous discrete manageable instant voice messages ,(2005)