A Case Study for Learning from Imbalanced Data Sets

作者: Aijun An , Nick Cercone , Xiangji Huang

DOI: 10.1007/3-540-45153-6_1

关键词:

摘要: We present our experience in applying a rule induction technique to an extremely imbalanced pharmaceutical data set. focus on using variety of performance measures evaluate number quality measures. also investigate whether simply changing the distribution skew training can improve predictive performance. Finally, we propose method for adjusting learning algorithm environment. Our experimental results show that this adjustment improves formulas which coverage makes positive contributions value.

参考文章(12)
Stan Matwin, Miroslav Kubat, Addressing the Curse of Imbalanced Training Sets: One-Sided Selection. international conference on machine learning. pp. 179- 186 ,(1997)
DENNIS KIBLER, DAVID W. AHA, Learning Representative Exemplars of Concepts: An Initial Case Study Proceedings of the Fourth International Workshop on MACHINE LEARNING#R##N#June 22–25, 1987 University of California, Irvine. pp. 24- 30 ,(1987) , 10.1016/B978-0-934613-41-5.50006-4
Donald Michie, Expert systems in the micro-electronic age Edinburgh University Press. ,(1979)
Claire Cardie, Nicholas Nowe, Improving Minority Class Prediction Using Case-Specific Feature Weights international conference on machine learning. pp. 57- 65 ,(1997)
Richard Duda, John Gaschnig, Peter Hart, MODEL DESIGN IN THE PROSPECTOR CONSULTANT SYSTEM FOR MINERAL EXPLORATION Morgan Kaufmann. pp. 334- 348 ,(1981) , 10.1016/B978-0-934613-03-3.50028-3
Aijun An, Nick Cercone, ELEM2: A Learning System for More Accurate Classifications Lecture Notes in Computer Science. pp. 426- 441 ,(1998) , 10.1007/3-540-64575-6_68
Miroslav Kubat, Robert C. Holte, Stan Matwin, Machine Learning for the Detection of Oil Spills in Satellite Radar Images Machine Learning. ,vol. 30, pp. 195- 215 ,(1998) , 10.1023/A:1007452223027
Miroslav Kubat, Robert Holte, Stan Matwin, Learning When Negative Examples Abound european conference on machine learning. pp. 146- 153 ,(1997) , 10.1007/3-540-62858-4_79
Aijun An, Nick Cercone, Rule Quality Measures Improve the Accuracy of Rule Induction: An Experimental Approach international syposium on methodologies for intelligent systems. pp. 119- 129 ,(2000) , 10.1007/3-540-39963-1_13
Michael J. Turmon, Gholamreza Nakhaeizadeh, Charles C. Taylor, Machine learning and statistics: the interface Journal of the American Statistical Association. ,vol. 93, pp. 833- ,(1996) , 10.2307/2670132