Automatic prosodic event detection using a novel labeling and selection method in co-training

作者: Je Hun Jeon , Yang Liu

DOI: 10.1016/J.SPECOM.2011.10.008

关键词: Event (computing)Supervised learningPitch accentComputer scienceArtificial intelligenceWord error ratePattern recognitionLearning curvePhraseCo-trainingSet (abstract data type)Natural language processing

摘要: Most previous approaches to automatic prosodic event detection are based on supervised learning, relying the availability of a corpus that is annotated with labels interest in order train classification models. However, creating such resources an expensive and time-consuming task. In this paper, we exploit semi-supervised learning co-training algorithm for coarse-level representation events as pitch accent, intonational phrase boundaries, break indices. Since works condition views compatible uncorrelated, real data often do not satisfy these conditions, propose method label select examples co-training. our experiments Boston University radio news corpus, when using only small amount labeled initial training set, proposed labeling can effectively use unlabeled improve performance finally reach close results more data. We perform thorough analysis various factors impacting curves, including error rate informativeness added examples, individual classifiers their difference, size.

参考文章(41)
Shasha Xie, Yang Liu, Hui Lin, Semi-supervised extractive speech summarization via co-training algorithm. conference of the international speech communication association. pp. 2522- 2525 ,(2010)
Je Hun Jeon, Yang Liu, Syllable-level prominence detection with acoustic evidence. conference of the international speech communication association. pp. 1772- 1775 ,(2010)
John F. Pitrelli, Janet B. Pierrehumbert, Julia Hirschberg, Colin W. Wightman, Mary E. Beckman, Mari Ostendorf, Patti Price, Kim E. A. Silverman, TOBI: a standard for labeling English prosody. conference of the international speech communication association. ,(1992)
Julia Hockenmaier, Paul Ruhlen, Miles Osborne, Jeremiah Crim, Rebecca Hwa, Mark Steedman, Steven Baker, Anoop Sarkar, Stephen Clark, CLSP WS-02 Final Report: Semi-Supervised Training for Statistical Parsing ,(2003)
Shrikanth S. Narayanan, Sankaranarayanan Ananthakrishnan, Combining acoustic, lexical, and syntactic evidence for automatic unsupervised prosody labeling. conference of the international speech communication association. ,(2006)
Ümit Güz, Sébastien Cuendet, Dilek Z. Hakkani-Tür, Gökhan Tür, Co-training Using Prosodic and Lexical Information for Sentence Segmentation conference of the international speech communication association. pp. 2597- 2600 ,(2007) , 10.5072/ZENODO.47054
Ion Muslea, Craig A. Knoblock, Steven Minton, Active + Semi-supervised Learning = Robust Multi-View Learning international conference on machine learning. pp. 435- 442 ,(2002)
Dilek Z. Hakkani-Tur, Gokhan Tur, Exploiting unlabeled utterances for spoken language understanding conference of the international speech communication association. ,(2003)
Ion Muslea, Craig A. Knoblock, Steven Minton, Selective Sampling with Redundant Views national conference on artificial intelligence. pp. 621- 626 ,(2000)
Sally A. Goldman, Yan Zhou, Enhancing Supervised Learning with Unlabeled Data international conference on machine learning. pp. 327- 334 ,(2000)