Parallel Perceptrons, Activation Margins and Imbalanced Training Set Pruning

作者: Iván Cantador , José R. Dorronsoro

DOI: 10.1007/11492542_6

关键词: Classifier (linguistics)Committee machinePerceptronPattern recognitionArtificial neural networkClassifier (UML)Machine learningComputer scienceArtificial intelligenceImage processingTraining set

摘要: A natural way to deal with training samples in imbalanced class problems is prune them removing redundant patterns, easy classify and probably over represented, label noisy patterns that belonging one are labelled as members of another. This allows classifier construction focus on borderline likely be the most informative ones. To appropriately define above subsets, this work we will use base classifiers so–called parallel perceptrons, a novel approach committee machine allows, among other things, naturally margins for hidden unit activations. We shall these pattern types iteratively perform subsample selections an initial set enhance classification accuracy allow balanced performance even when sizes greatly different.

参考文章(16)
Stan Matwin, Miroslav Kubat, Addressing the Curse of Imbalanced Training Sets: One-Sided Selection. international conference on machine learning. pp. 179- 186 ,(1997)
Peter Auer, Harald Burgsteiner, Wolfgang Maass, Reducing Communication for Distributed Learning in Neural Networks international conference on artificial neural networks. pp. 123- 128 ,(2002) , 10.1007/3-540-46084-5_21
Tom Fawcett, Foster Provost, Adaptive Fraud Detection Data Mining and Knowledge Discovery. ,vol. 1, pp. 291- 316 ,(1997) , 10.1023/A:1009700419189
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
David C. Sterratt, Arjen van Ooyen, Does Morphology Influence Temporal Plasticity international conference on artificial neural networks. pp. 186- 191 ,(2002) , 10.1007/3-540-46084-5_31
J. Swets, Measuring the accuracy of diagnostic systems Science. ,vol. 240, pp. 1285- 1293 ,(1988) , 10.1126/SCIENCE.3287615
Y. Freund, Boosting a weak learning algorithm by majority Information & Computation. ,vol. 121, pp. 256- 285 ,(1995) , 10.1006/INCO.1995.1136
C. L. Blake, UCI Repository of machine learning databases www.ics.uci.edu/〜mlearn/MLRepository.html. ,(1998)