Integer programming models and algorithms for molecular classification of cancer from microarray data

作者: Pablo Moscato , Regina Berretta , Alexandre Mendes

DOI:

关键词:

摘要: Novel, high-throughput technologies are challenging the core of algorithmic methods available in Computer Science. Microarray give Life Sciences researchers opportunity to simultaneously measure thousands gene expression levels under different conditions or coming from cell lines. With appropriate data mining models and algorithms, this would lead a systematic exploration molecular classification cancer, just one among many other exciting applications. The aim paper is present unified mathematical formalization for feature selection problems investigate their performance cancer cell-lines. We also some results using NCI60 dataset.

参考文章(14)
Carlos Cotta, Alexandre Mendes, Vinícius Garcia, Paulo França, Pablo Moscato, Applying memetic algorithms to the analysis of microarray data Lecture Notes in Computer Science. pp. 22- 32 ,(2003) , 10.1007/3-540-36605-9_3
Carlos Cotta, Christian Sloper, Pablo Moscato, Evolutionary Search of Thresholds for Robust Feature Set Selection: Application to the Analysis of Microarray Data Lecture Notes in Computer Science. pp. 21- 30 ,(2004) , 10.1007/978-3-540-24653-4_3
J. Marx, MEDICINE: DNA Arrays Reveal Cancer in Its Many Forms Science. ,vol. 289, pp. 1670- 1672 ,(2000) , 10.1126/SCIENCE.289.5485.1670
Alberto Caprara, Paolo Toth, Matteo Fischetti, Algorithms for the Set Covering Problem Annals of Operations Research. ,vol. 98, pp. 353- 371 ,(2000) , 10.1023/A:1019225027893
Douglas T Ross, Uwe Scherf, Michael B Eisen, Charles M Perou, Christian Rees, Paul Spellman, Vishwanath Iyer, Stefanie S Jeffrey, Matt Van de Rijn, Mark Waltham, Alexander Pergamenschikov, JC Lee, Deval Lashkari, Dari Shalon, Timothy G Myers, John N Weinstein, David Botstein, Patrick O Brown, None, Systematic variation in gene expression patterns in human cancer cell lines. Nature Genetics. ,vol. 24, pp. 227- 235 ,(2000) , 10.1038/73432
HUIQING LIU, LIMSOON WONG, Data mining tools for biological sequences. Journal of Bioinformatics and Computational Biology. ,vol. 1, pp. 139- 167 ,(2003) , 10.1142/S0219720003000216
Carlos Cotta, Pablo Moscato, The k -feature set problem is W [2]-complete Journal of Computer and System Sciences. ,vol. 67, pp. 686- 690 ,(2003) , 10.1016/S0022-0000(03)00081-3
Lei Yu, Huan Liu, Redundancy based feature selection for microarray data knowledge discovery and data mining. pp. 737- 742 ,(2004) , 10.1145/1014052.1014149
Trond Hellem Bø, Bjarte Dysvik, Inge Jonassen, LSimpute: accurate estimation of missing values in microarray data with least squares methods. Nucleic Acids Research. ,vol. 32, ,(2004) , 10.1093/NAR/GNH026