Seminal quality prediction using data mining methods.

作者: Anoop J. Sahoo , Yugal Kumar

DOI: 10.3233/THC-140816

关键词:

摘要: Background Now-a-days, some new classes of diseases have come into existences which are known as lifestyle diseases. The main reasons behind these changes in the people such alcohol drinking, smoking, food habits etc. After going through various diseases, it has been found that fertility rates (sperm quantity) men considerably decreasing last two decades. Lifestyle factors well environmental mainly responsible for change semen quality. Objective objective this paper is to identify and features affects seminal quality also rate man using data mining methods. Method five artificial intelligence techniques Multilayer perceptron (MLP), Decision Tree (DT), Navie Bayes (Kernel), Support vector machine+Particle swarm optimization (SVM+PSO) machine (SVM) applied on dataset evaluate predict person either normal or having altered rate. While eight feature selection support (SVM), neural network (NN), evolutionary logistic regression (LR), plus particle (SVM+PSO), principle component analysis (PCA), chi-square test, correlation T-test methods used more relevant affect These contains 100 instances with nine attribute classes. Results experimental result shows SVM+PSO provides higher accuracy area under curve (AUC) (94% & 0.932) among multi-layer (MLP) (92% 0.728), Vector Machines (91% 0.758), (Kernel) (89% 0.850) 0.735) parameters. This focuses process i.e. how select important prediction In paper, find out a set good features. investigational results childish (0.079) high fever (0.057) less impact while age (0.8685), season (0.843), surgical intervention (0.7683), consumption (0.5992), smoking habit (0.575), number hours spent setting (0.4366) accident (0.5973) impact. It observed increase above mentioned (multilayer 92%, 91%, 94%, 89% decision tree 89%) compared without 86%, 85%, 83% 84%) applicability prediction. Conclusion lightens application medical domain. From can be concluded disease based parameters/features rather than undergoing test. provide accurate tree.

参考文章(35)
David E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams, Learning representations by back-propagating errors Nature. ,vol. 323, pp. 696- 699 ,(1988) , 10.1038/323533A0
Yamuna Prasad, K. K. Biswas, PSO - SVM Based Classifiers: A Comparative Approach international conference on contemporary computing. pp. 241- 252 ,(2010) , 10.1007/978-3-642-14834-7_23
Christopher M. Bishop, Neural networks for pattern recognition ,(1995)
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
Yugal Kumar, G. Sahoo, Prediction of different types of liver diseases using rule based classification model Technology and Health Care. ,vol. 21, pp. 417- 432 ,(2013) , 10.3233/THC-130742
Nir Friedman, Dan Geiger, Moises Goldszmidt, Bayesian Network Classifiers Machine Learning. ,vol. 29, pp. 131- 163 ,(1997) , 10.1023/A:1007465528199
George H. John, Pat Langley, Estimating continuous distributions in Bayesian classifiers uncertainty in artificial intelligence. pp. 338- 345 ,(1995)
Aleksander Giwercman, Yvonne Lundberg Giwercman, Environmental factors and testicular function. Best Practice & Research Clinical Endocrinology & Metabolism. ,vol. 25, pp. 391- 402 ,(2011) , 10.1016/J.BEEM.2010.09.011