Particle Swarm Optimization Based Swarm Intelligence for Active Learning Improvement: Application on Medical Data Classification

作者: Nawel Zemmal , Nabiha Azizi , Mokhtar Sellami , Soraya Cheriguene , Amel Ziani

DOI: 10.1007/S12559-020-09739-Z

关键词:

摘要: Semi-supervised learning targets the common situation where labeled data are scarce but unlabeled abundant. It uses to help supervised tasks. In practice, it may make sense utilize active in conjunction with semi-supervised learning. That is, we might allow algorithm pick a set of instances be by domain expert, which will then used as set. However, existing approaches computationally expensive and require searching through an entire dataset, contain redundant that provide no instructive information classifier can decrease performance. To address this optimization problem, hybrid system combines (AL) particle swarm (PSO) algorithms is proposed reduce cost labeling while building more efficient classifier. The novelty work resides integration bio-inspired machine strategy. Furthermore, novel uncertainty measure was integrated into objective function select from massive amounts medical those deemed most informative. evaluate effectiveness approach, eighteen (18) benchmark datasets were compared against three best-known classifiers different paradigms: AL–NB using Naive Base Margin Sampling strategy, SVM (Support Vector Machine), ELM (Extreme Learning Machine) learning, TSVM (Transductive Support Experiments showed approach effective reducing efforts required experts for annotation produce accurate has been utilized optimize task labeling. Based on measure, nature-inspired PSO attempts considered informative, at same time improving experiments carried out confirm strategy significantly enhances performance AL commonly strategies. achieves similar fully requiring much less As future extension work, would interesting integrate other evolutionary compare them our approach. addition, beneficial test impact variants Also, aimed classification experimentation process.

参考文章(71)
Jian Shang, Gang Ye, Ke Shi, Yushun Wan, Chuming Luo, Hideki Aihara, Qibin Geng, Ashley Auerbach, Fang Li, Structural basis of receptor recognition by SARS-CoV-2. Nature. ,vol. 581, pp. 221- 224 ,(2020) , 10.1038/S41586-020-2179-Y
Kevin Bache, Moshe Lichman, UCI Machine Learning Repository University of California, School of Information and Computer Science. ,(2007)
M. Heath, K. Bowyer, D. Kopans, P. Kegelmeyer, R. Moore, K. Chang, S. Munishkumaran, Current Status of the Digital Database for Screening Mammography Digital Mammography / IWDM. pp. 457- 460 ,(1998) , 10.1007/978-94-011-5318-8_75
D. Kopans, K. Bowyer, R. Moore, M. Heath, THE DIGITAL DATABASE FOR SCREENING MAMMOGRAPHY ,(2007)
Nicholas Roy, Andrew McCallum, Toward Optimal Active Learning through Sampling Estimation of Error Reduction international conference on machine learning. pp. 441- 448 ,(2001)
Simon Tong, Daphne Koller, Support Vector Machine Active Learning with Application sto Text Classification international conference on machine learning. pp. 999- 1006 ,(2000)
Ioannis Kapouleas, Sholom M. Weiss, An empirical comparison of pattern recognition, neural nets, and machine learning classification methods international joint conference on artificial intelligence. pp. 781- 787 ,(1989)
Nawel Zemmal, Nabiha Azizi, Mokhtar Sellami, CAD system for classification of mammographic abnormalities using transductive semi supervised learning algorithm and heterogeneous features 2015 12th International Symposium on Programming and Systems (ISPS). pp. 1- 9 ,(2015) , 10.1109/ISPS.2015.7244993
Axel-Cyrille Ngonga Ngomo, Klaus Lyko, EAGLE: Efficient Active Learning of Link Specifications Using Genetic Programming Lecture Notes in Computer Science. pp. 149- 163 ,(2012) , 10.1007/978-3-642-30284-8_17
Tobias Scheffer, Christian Decomain, Stefan Wrobel, Active Hidden Markov Models for Information Extraction intelligent data analysis. pp. 309- 318 ,(2001) , 10.1007/3-540-44816-0_31