作者: Rong Zhang , A.I. Rudnicky
DOI: 10.1109/ICASSP.2006.1660047
关键词:
摘要: Current approaches to semi-supervised incremental learning prefer select unlabeled examples predicted with high confidence for model re-training. However, this strategy can degrade the classification performance rather than improve it. We present an analysis reasons of phenomenon, showing that only relying on data selection lead erroneous estimate true distribution when annotator is highly correlated classifier in information they use. propose a new approach address problem and apply it variety applications, including machine speech recognition. Encouraging improvements recognition accuracy are observed our experiments.