An integrative approach to identifying biologically relevant genes

作者: Huan Liu , Nitin Agarwal , Shashvata Sharma , Yung Chang , Zheng Zhao

DOI:

关键词:

摘要: Gene selection aims at detecting biologically relevant genes to assist biologists’ research. The cDNA Microarray data used in gene is usually “wide”. With more than several thousand genes, but only less a hundred of samples, many irrelevant can gain their statistical relevance by sheer randomness. Addressing this problem goes beyond what the offer and necessitates use additional information. Recent developments bioinformatics have made various knowledge sources available, such as KEGG pathway repository Ontology database. Integrating different types could provide information about samples. In work, we propose novel approach integrate for identifying genes. converts external its internal knowledge, which be rank Upon obtaining ranking lists, it aggregates them via probabilistic model generates final list. Experimental results from our study on acute lymphoblastic leukemia demonstrate efficacy proposed show that using together help detect

参考文章(43)
Anke van Zuylen, Frans Schalekamp, Rank aggregation: together we're strong algorithm engineering and experimentation. pp. 38- 51 ,(2009)
Robert Tibshirani, Trevor Hastie, Jerome H. Friedman, The Elements of Statistical Learning ,(2001)
P. C. Mahalanobis, On the generalized distance in statistics Proceedings of the National Institute of Sciences (Calcutta). ,vol. 2, pp. 49- 55 ,(1936)
Pavel Pevzner, Neil C. Jones, An Introduction to Bioinformatics Algorithms ,(2004)
Hua Yu, Richard Jove, The STATs of cancer — new molecular targets come of age Nature Reviews Cancer. ,vol. 4, pp. 97- 105 ,(2004) , 10.1038/NRC1275
Christopher M. Bishop, Pattern Recognition and Machine Learning ,(2006)
Alexander J Smola, Risi Kondor, None, Kernels and Regularization on Graphs Learning Theory and Kernel Machines. pp. 144- 158 ,(2003) , 10.1007/978-3-540-45167-9_12
Catia Pesquita, Daniel Faria, Hugo Bastos, António EN Ferreira, André O Falcão, Francisco M Couto, Metrics for GO based protein semantic similarity: a systematic evaluation BMC Bioinformatics. ,vol. 9, pp. 1- 16 ,(2008) , 10.1186/1471-2105-9-S5-S4
S.J. Raudys, A.K. Jain, Small sample size effects in statistical pattern recognition: recommendations for practitioners IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 13, pp. 252- 264 ,(1991) , 10.1109/34.75512