Predicting Seminal Quality via Imbalanced Learning with Evolutionary Safe-Level Synthetic Minority Over-Sampling Technique

作者: Jieming Ma , David Olalekan Afolabi , Jie Ren , Aiyan Zhen

DOI: 10.1007/S12559-019-09657-9

关键词:

摘要: Seminal quality has fallen dramatically over the past two decades. Research indicates that environmental factors, health status, and life habits might lead to decline. Prediction of seminal is very useful in early diagnosis infertile patients. Recently, artificial intelligence (AI) technologies have been applied study male fertility potential. As it common many real applications about cognitive computation, prediction faces problem class imbalance, conventional algorithms are often biased towards majority class. In this paper, an evolutionary safe-level synthetic minority over-sampling technique (ESLSMOTE) proposed synthesize instances along same line with different weight degree, called safe level. The profile individual from dataset predicted via three classification methods ESLSMOTE. Important indicators, such as accuracy, precision, recall, receiver operating characteristic (ROC) curve, F1-score, used evaluate performance classifiers ESLSMOTE based on a tenfold cross-validation scheme. experimental results show can significantly improve accuracy back-propagation neural network, adaptive boosting, support vector machine. highest area under ROC curve (97.2%) given by ESLSMOTE-AdaBoost model. Experimental indicate ESLSMOTE-based outperform current state-of-the-art predicting terms curve. such, capability high accuracy.

参考文章(45)
Stan Matwin, Miroslav Kubat, Addressing the Curse of Imbalanced Training Sets: One-Sided Selection. international conference on machine learning. pp. 179- 186 ,(1997)
Chumphol Bunkhumpornpat, Krung Sinapiromsaran, Chidchanok Lursinsap, Safe-Level-SMOTE: Safe-Level-Synthetic Minority Over-Sampling TEchnique for Handling the Class Imbalanced Problem Advances in Knowledge Discovery and Data Mining. pp. 475- 482 ,(2009) , 10.1007/978-3-642-01307-2_43
N. Jørgensen, M. Vierula, R. Jacobsen, E. Pukkala, A. Perheentupa, H. E. Virtanen, N. E. Skakkebaek, J. Toppari, Recent adverse trends in semen quality and testis cancer incidence among Finnish men International Journal of Andrology. ,vol. 34, ,(2011) , 10.1111/J.1365-2605.2010.01133.X
Anoop J. Sahoo, Yugal Kumar, Seminal quality prediction using data mining methods. Technology and Health Care. ,vol. 22, pp. 531- 545 ,(2014) , 10.3233/THC-140816
Wei Xia, Chang-Hong Zhu, Cheng-Liang Xiong, Meng Rao, Tian-Qing Meng, Si-Heng Hu, Huang-Tao Guan, Qin-Yu Wei, Evaluation of semen quality in 1808 university students, from Wuhan, Central China. Asian Journal of Andrology. ,vol. 17, pp. 111- 116 ,(2015) , 10.4103/1008-682X.135984
Ruifeng Xu, Tao Chen, Yunqing Xia, Qin Lu, Bin Liu, Xuan Wang, Word Embedding Composition for Data Imbalances in Sentiment and Emotion Classification Cognitive Computation. ,vol. 7, pp. 226- 240 ,(2015) , 10.1007/S12559-015-9319-Y
Krung Sinapiromsaran, Chidchanok Lursinsap, Chumphol Bunkhumpornpat, DBSMOTE: Density-Based Synthetic Minority Over-sampling TEchnique Applied Intelligence. ,vol. 36, pp. 664- 684 ,(2012) , 10.1007/S10489-011-0287-Y
A I Marqués, V García, J S Sánchez, On the suitability of resampling techniques for the class imbalance problem in credit scoring Journal of the Operational Research Society. ,vol. 64, pp. 1060- 1070 ,(2013) , 10.1057/JORS.2012.120
Grazia Petrelli, Alberto Mantovani, Environmental risk factors and male fertility and reproduction Contraception. ,vol. 65, pp. 297- 300 ,(2002) , 10.1016/S0010-7824(02)00298-6
Javier Romero-Otero, José Medina-Polo, Borja García-Gómez, David Lora-Pablos, José Manuel Duarte-Ojeda, Lucía García-González, Eduardo García-Cruz, Alfredo Rodríguez-Antolín, Semen Quality Assessment in Fertile Men in Madrid During the Last 3 Decades Urology. ,vol. 85, pp. 1333- 1338 ,(2015) , 10.1016/J.UROLOGY.2015.02.001