Clustering and combinatorial optimization in recursive supervised learning

作者: Kiruthika Ramanathan , Sheng Uei Guan

DOI: 10.1007/S10878-006-9017-5

关键词: Boosting (machine learning)Artificial neural networkCluster analysisEvolutionary algorithmOverfittingArtificial intelligenceTheory of computationMathematicsMachine learningSupervised learningCombinatorial optimization

摘要: The use of combinations weak learners to learn a dataset has been shown be better than the single strong learner. In fact, idea is so successful that boosting, an algorithm combining several for supervised learning, considered best off shelf classifier. However, some problems still exist, including determining optimal number and over fitting data. earlier work, we developed RPHP which solves both these by using combination global search, learning pattern distribution. this chapter, revise search component replacing it with cluster based combinatorial optimization. Patterns are clustered according output space problem, i.e., natural clusters formed on patterns belonging each class. A optimization problem therefore created, solved evolutionary algorithms. algorithms identify “easy” “difficult” in system. removal easy then gives way focused more complicated patterns. becomes recursively simpler. Over overcome set validation along distributor. An also proposed distributor determine recursions hence problem. Empirical studies show generally good performance when compared other state art methods.

参考文章(22)
Robert Tibshirani, Trevor Hastie, Jerome H. Friedman, The Elements of Statistical Learning ,(2001)
A. P. Engelbrecht, R. Brits, Supervised Training Using an Unsupervised Approach to Active Learning Neural Processing Letters. ,vol. 15, pp. 247- 260 ,(2002) , 10.1023/A:1015733517815
Byoung-Tak Zhang, Dong-Yeon Cho, Genetic Programming with Active Data Selection simulated evolution and learning. pp. 146- 153 ,(1998) , 10.1007/3-540-48873-1_20
Robert E. Schapire, A brief introduction to boosting international joint conference on artificial intelligence. ,vol. 2, pp. 1401- 1406 ,(1999)
James Franklin, The elements of statistical learning : data mining, inference,and prediction The Mathematical Intelligencer. ,vol. 27, pp. 83- 85 ,(2005) , 10.1007/BF02985802
Teuvo Kohonen, Self-Organizing Maps ,(1995)
M. Anthony Wong, Tom Lane, A Kth Nearest Neighbour Clustering Procedure ,(2011)
B-L Lu, K Ito, H Kita, Y Nishikawa, None, A parallel and modular multi-sieving neural network architecture for constructive learning 4th International Conference on Artificial Neural Networks. pp. 92- 97 ,(1995) , 10.1049/CP:19950535
A. K. Jain, M. N. Murty, P. J. Flynn, Data clustering: a review ACM Computing Surveys. ,vol. 31, pp. 264- 323 ,(1999) , 10.1145/331499.331504
A.K. Jain, Jianchang Mao, K.M. Mohiuddin, Artificial neural networks: a tutorial computational science and engineering. ,vol. 29, pp. 31- 44 ,(1996) , 10.1109/2.485891