作者: Kishor Morkhandikar , Pallaki Gururaj , Ian M. Bennett , Bandi Ramesh Babu
DOI:
关键词:
摘要: A real-time system (100) incorporating speech recognition and linguistic processing for recognizing a spoken query by user distributed between client (150) server (180), is disclosed. The accepts user's queries in the form of at where minimal extracts sufficient number acoustic vectors representing utterance. These are sent via communications channel (160A) to (180) additional derived. Using Hidden Markov Models (HMMs), appropriate grammars dictionaries conditioned selections made user, fully decoded into text (or some other suitable form) (180). corresponding then simultaneously natural language engine (190) database processor (186) optimized SQL statements constructed full-text search from (188) record set several stored questions that best matches query. Further narrows single question. answer this question next retrieved file path compressed form. At (150), articulated using text-to-speech (159) his or her native language. requires no training can operate languages.