A unified view for discriminative objective functions based on negative exponential of difference measure between strings

作者: Atsushi Nakamura , Erik McDermott , Shinji Watanabe , Shigeru Katagiri

DOI: 10.1109/ICASSP.2009.4959913

关键词: Component (UML)Function (mathematics)Artificial intelligenceMathematicsDiscriminative modelPattern recognitionMutual informationMeasure (mathematics)Joint probability distributionPattern recognition (psychology)Weighting

摘要: This paper presents a novel unified view of wide variety objective functions suitable for discriminative training applied to sequential pattern recognition problems, such as automatic speech recognition. Focusing on central component conventional functions, the sum modified joint probabilities observations and strings, analysis generalizes these by weighting each term in an important function, negative exponential difference measure between strings. The interesting valuable results this investigation are highlighted comprehensive relationship chart that covers all common approaches (Maximum Mutual Information, Minimum Classification Error, Phone/Word Error), well corresponding generalizations modifications those approaches.

参考文章(9)
Andreas Stolcke, Jing Zheng, Improved discriminative training using phone lattices. conference of the international speech communication association. pp. 2125- 2128 ,(2005)
George Saon, Daniel Povey, Penalty function maximization for large margin HMM training. conference of the international speech communication association. pp. 920- 923 ,(2008)
Erik McDermott, Atsushi Nakamura, Flexible discriminative training based on equal error group scores obtained from an error-indexed forward-backward algorithm. conference of the international speech communication association. pp. 2398- 2401 ,(2008)
Xiaodong He, Li Deng, Wu Chou, Discriminative learning in sequential pattern recognition IEEE Signal Processing Magazine. ,vol. 25, pp. 14- 36 ,(2008) , 10.1109/MSP.2008.926652
Georg Heigold, Thomas Deselaers, Ralf Schlüter, Hermann Ney, Modified MMI/MPE Proceedings of the 25th international conference on Machine learning - ICML '08. pp. 384- 391 ,(2008) , 10.1145/1390156.1390205
Daniel Povey, Dimitri Kanevsky, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Karthik Visweswariah, Boosted MMI for model and feature-space discriminative training international conference on acoustics, speech, and signal processing. pp. 4057- 4060 ,(2008) , 10.1109/ICASSP.2008.4518545
D. Povey, P.C. Woodland, Minimum Phone Error and I-smoothing for improved discriminative training international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 105- 108 ,(2002) , 10.1109/ICASSP.2002.5743665
Ralf Schlüter, Hermann Ney, Lars Haferkamp, Wolfgang Macherey, Investigations on Error Minimizing Training Criteria for Discriminative Training in Automatic Speech Recognition conference of the international speech communication association. pp. 2133- 2136 ,(2005)
Erik McDermott, Timothy J. Hazen, Jonathan Le Roux, Atsushi Nakamura, Shigeru Katagiri, Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 15, pp. 203- 223 ,(2007) , 10.1109/TASL.2006.876778