作者: Andrew K. Rider , Reid A. Johnson , Darcy A. Davis , T. Ryan Hoens , Nitesh V. Chawla
DOI: 10.1007/978-3-642-41398-8_33
关键词: Machine learning 、 Mathematics 、 Artificial intelligence 、 Data mining 、 Classifier (UML)
摘要: The concept of a negative class does not apply to many problems for which classification is increasingly utilized. In this study we investigate the reliability evaluation metrics when contains an unknown proportion mislabeled positive instances. We examine how can inform us about potential systematic biases in data. provide motivating case and general framework approaching show that behavior unstable presence uncertainty labels stability depends on kind bias Finally, type amount present data have significant effect ranking degree they over- or underestimate true performance classifiers.