Training with Input Selection and Testing (TWIST) Algorithm: A Significant Advance in Pattern Recognition Performance of Machine Learning

作者: Massimo Buscema , Marco Breda , Weldon Lodwick

DOI: 10.4236/JILSA.2013.51004

关键词: Artificial neural networkComputer sciencePattern recognitionVariablesMachine learningProbability density functionAlgorithmCross-validationCoding (social sciences)SoftwareEvolutionary computationArtificial intelligenceTest dataData mining

摘要: This article shows the efficacy of TWIST, a methodology for design training and testing data subsets extracted from given dataset associated with problem to be solved via ANNs. The we present is embedded in algorithms actualized computer software. Our as implemented software compared current standard methods random cross validation: 10-Fold CV, split into two more advanced T&T. For each strategy, 13 learning machines, representing different families main algorithms, have been trained tested. All were using well-known WEKA package. On one hand falsification test randomly distributed dependent variable has used show how T&T TWIST behaves other strategies: when there no information available on datasets they are equivalent. hand, real Statlog (Heart) dataset, strong difference accuracy experimentally proved. results that superior methods. Pairs similar probability density functions generated, without coding noise, according an optimal strategy extracts most useful pattern classification.

参考文章(41)
N. Chaikla, Yulu Qi, Genetic algorithms in feature selection systems man and cybernetics. ,vol. 5, pp. 538- 540 ,(1999) , 10.1109/ICSMC.1999.815609
Andreas Moser, M. Narasimha Murty, On the Scalability of Genetic Algorithms to Very Large-Scale Feature Selection Real-World Applications of Evolutionary Computing, EvoWorkshops 2000: EvoIASP, EvoSCONDI, EvoTel, EvoSTIM, EvoROB, and EvoFlight. pp. 77- 86 ,(2000) , 10.1007/3-540-45561-2_8
George H John, Ron Kohavi, Karl Pfleger, None, Irrelevant Features and the Subset Selection Problem Machine Learning Proceedings 1994. pp. 121- 129 ,(1994) , 10.1016/B978-1-55860-335-6.50023-4
S. Penco, E. Grossi, S. Cheng, M. Intraligi, G. Maurelli, M. C. Patrosso, A. Marocchi, M. Buscema, Assessment of the Role of Genetic Polymorphism in Venous Thrombosis Through Artificial Neural Networks Annals of Human Genetics. ,vol. 69, pp. 693- 706 ,(2005) , 10.1111/J.1529-8817.2005.00206.X
Maria E Street, Enzo Grossi, Cecilia Volta, Elena Faleschini, Sergio Bernasconi, Placental determinants of fetal growth: identification of key factors in the insulin-like growth factor and cytokine systems using artificial neural networks BMC Pediatrics. ,vol. 8, pp. 24- 24 ,(2008) , 10.1186/1471-2431-8-24
MTW, Huan Liu, Hiroshi Motoda, Feature Extraction, Construction and Selection: A Data Mining Perspective Journal of the American Statistical Association. ,vol. 94, pp. 1390- ,(1998) , 10.2307/2669967
Silvana Penco, Massimo Buscema, Maria Cristina Patrosso, Alessandro Marocchi, Enzo Grossi, New application of intelligent agents in sporadic amyotrophic lateral sclerosis identifies unexpected specific genetic background BMC Bioinformatics. ,vol. 9, pp. 254- 254 ,(2008) , 10.1186/1471-2105-9-254
Fabio Pace, Gabriele Riegler, Annalisa de Leone, Marina Pace, Renzo Cestari, Patrizia Dominici, Enzo Grossi, EMERGE Study Group, Is it possible to clinically differentiate erosive from nonerosive reflux disease patients? A study using an artificial neural networks-assisted algorithm European Journal of Gastroenterology & Hepatology. ,vol. 22, pp. 1163- 1168 ,(2010) , 10.1097/MEG.0B013E32833A88B8