Novel Data Mining Techniques in aCGH based Breast Cancer Subtypes Profiling: the Biological Perspective

作者: F. Menolascina , S. Tommasi , A. Paradiso , M. Cortellino , V. Bevilacqua

DOI: 10.1109/CIBCB.2007.4221198

关键词: Artificial intelligenceProfiling (information science)Computational intelligenceNaive Bayes classifierC4.5 algorithmGene expression programmingRule inductionDecision treeComputer scienceData miningMachine learningIntelligent decision support system

摘要: In this paper we present a comparative study among well established data mining algorithm (namely J48 and naive Bayes tree) novel machine learning paradigms like ant miner gene expression programming. The aim of was to discover significant rules discriminating ER+ ER-cases breast cancer. We compared both statistical accuracy biological validity the results using common methods ontology. Some worth noting characteristics these systems have been observed analysed even giving some possible interpretations findings. With tried show how intelligent can be employed in design experimental pipeline disease processes investigation deriving high-throughput validated new computational tools. Results returned by approach seem encourage efforts field

参考文章(24)
Goutham Kurra, Raj Bhatnagar, Wen Niu, Mining microarray expression data for classifier gene-cores international conference on data mining. pp. 8- 14 ,(2001)
Daniel Pinkel, Richard Segraves, Damir Sudar, Steven Clark, Ian Poole, David Kowbel, Colin Collins, Wen-Lin Kuo, Chira Chen, Ye Zhai, Shanaz H. Dairkee, Britt-marie Ljung, Joe W. Gray, Donna G. Albertson, High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays Nature Genetics. ,vol. 20, pp. 207- 211 ,(1998) , 10.1038/2524
Vanathi Gopalakrishnan, Philip Ganchev, Srikanth Ranganathan, Robert Bowser, Rule learning for disease-specific biomarker discovery from clinical proteomic mass spectra international conference on data mining. pp. 93- 105 ,(2006) , 10.1007/11691730_10
Cândida Ferreira, Gene Expression Programming: A New Adaptive Algorithm for Solving Problems. Complex Systems. ,vol. 13, ,(2001)
Amandeep S Sidhu, Paul J Kennedy, Simeon Simoff, Tharam S Dillon, Elizabeth Chang, Knowledge Discovery in Biomedical Data Facilitated by Domain Ontologies knowledge discovery and data mining. pp. 189- 201 ,(2007) , 10.4018/978-1-59904-252-7.CH010
Edward R. Dougherty, Edward R. Dougherty, Marcel Brun, On the number of close-to-optimal feature sets. Cancer Informatics. ,vol. 2, pp. 189- 196 ,(2006) , 10.4137/CIN.S0
Atul J. Butte, Alvin Kho, Isaac S. Kohane, Microarrays for an Integrative Genomics ,(2002)
Allen Chan, Alex A. Freitas, A new ant colony algorithm for multi-label classification with applications in bioinfomatics Proceedings of the 8th annual conference on Genetic and evolutionary computation - GECCO '06. pp. 27- 34 ,(2006) , 10.1145/1143997.1144002
Fulvia Ferrazzi, Roberta Amici, Paola Sebastiani, Isaac Kohane, Marco Ramoni, Riccardo Bellazzi, Can we use linear Gaussian networks to model dynamic interactions among genes? Results from a simulation study international conference on bioinformatics. pp. 13- 14 ,(2006) , 10.1109/GENSIPS.2006.353132
T. R. Golub, D. K. Slonim, P. Tamayo, C. Huard, M. Gaasenbeek, J. P. Mesirov, H. Coller, M. L. Loh, J. R. Downing, M. A. Caligiuri, C. D. Bloomfield, E. S. Lander, Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science. ,vol. 286, pp. 531- 537 ,(1999) , 10.1126/SCIENCE.286.5439.531