Method and apparatus for processing scripts

作者: David A. Kuspa , Walter W. Chang , R. Scoggins Ii Jerry

DOI:

关键词: Set (abstract data type)Scripting languageNatural language processingSpeech recognitionComputer scienceMatching (statistics)Artificial intelligence

摘要: Provided in some embodiments is a computer implemented method that includes providing script data including words indicative of dialogue to be spoken, recorded audio corresponding at least portion the wherein timecodes associated with words, matching determine alignment points, determining set unmatched are accurate based on matched generating time-aligned and their determined words.

参考文章(39)
Oren Glickman, Jean-Manuel Van Thong, Pedro J. Moreno, Christopher F. Joerg, A recursive algorithm for the forced alignment of very long audio segments. conference of the international speech communication association. ,(1998)
Timothy J. Hazen, Automatic Alignment and Error Correction of Human Generated Transcripts for Long Speech Recordings conference of the international speech communication association. ,(2006)
Jian-Iai Zhou, Dongmei Zhang, Peng Liu, Frank Kao-Ping Soong, Minimum divergence based discriminative training for pattern recognition ,(2007)
Jean-Manuel Van Thong, Pedro Moreno, Method for refining time alignments of closed captions ,(1999)
Cyril Goutte, Michel Simard, Arne Mauser, Kenji Yamada, Eric Gaussier, Apparatus and methods for aligning words in bilingual sentences ,(2005)
Ariff Sidi, Jason R. Grant, Christopher White, Skarphedinn Hedinsson, Yii Lih Liu, David Watson, Jonathan Barsook, System and method for real-time media presentation using metadata clips ,(2009)
Arthur Keller, Kerry A. Ortega, Carmi Gazit, Waltraud Brunner, Thomas Netousek, Antonio R. Lee, Brian S. Brooks, Off site voice enrollment on a transcription device for speech recognition ,(1999)
Enrique Ruiz-Velasco, Shafiq Kassam, Shahzaib Zafar, Media content distribution systems and methods ,(2009)