作者: Yunxin Zhao
DOI: 10.1121/1.426675
关键词: Computer science 、 Calibration (statistics) 、 Adaptation (computer science) 、 Phone 、 Dependency (UML) 、 Speaker recognition 、 Speaker diarisation 、 Normalization (statistics) 、 Speech recognition 、 Variation (linguistics) 、 Acoustics
摘要: A speaker adaptation technique based on the separation of speech spectra variation sources is developed for improving speaker-independent continuous recognition. The include acoustic characteristics, and contextual dependency allophones. Statistical methods are formulated to normalize characteristics then adapt mixture Gaussian density phone models phonologic characteristics. Adaptation experiments using short calibration (5 sec./speaker) have shown substantial performance improvement over baseline recognition system.