A machine learning approach for gene expression analysis and applications

作者: Tom Altman , Thanh Ngoc Le , Katheleen Gardiner

DOI:

关键词:

摘要: High-throughput microarray technology is an important and revolutionary technique used in genomics systems biology to analyze the expression of thousands genes simultaneously. The popular use this has resulted enormous repositories data, for example, Gene Expression Omnibus (GEO), maintained by National Center Biotechnology Information (NCBI). However, effective approach optimally exploit these datasets support specific biological studies still lacking. Specifically, improved method required integrate data from multiple sources select only those that meet investigator's interest. In addition, full power determine relationships among selected interpret meanings behind relationships. To address requirements, we have developed a machine learning based includes: • An meta-analysis sources; exploits information regarding context interest provided biologists. A novel cluster analysis identify hidden patterns representing between under conditions motif finding discovers, not common transcription factor binding sites co-regulated genes, but also miRNA associated with conditions. learning-based framework web application run tasks on online.

参考文章(140)
Xiaolu Sturgeon, Thanh Le, Md. Mahiuddin Ahmed, Katheleen J. Gardiner, Pathways to cognitive deficits in Down syndrome. Progress in Brain Research. ,vol. 197, pp. 73- 100 ,(2012) , 10.1016/B978-0-444-54299-1.00005-4
Charles Elkan, Timothy L. Bailey, The value of prior knowledge in discovering motifs with MEME. intelligent systems in molecular biology. ,vol. 3, pp. 21- 29 ,(1995)
Dhammika Amaratunga, Javier Cabrera, Exploration and Analysis of DNA Microarray and Protein Array Data ,(2003)
Ronglai Shen, Debashis Ghosh, Arul M Chinnaiyan, Prognostic meta-signature of breast cancer developed by two-stage mixture modeling of microarray data BMC Genomics. ,vol. 5, pp. 94- 94 ,(2004) , 10.1186/1471-2164-5-94
Patrick Warnat, Roland Eils, Benedikt Brors, Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes. BMC Bioinformatics. ,vol. 6, pp. 265- 265 ,(2005) , 10.1186/1471-2105-6-265
Sir Ronald Aylmer Fisher, Statistical Methods for Research Workers ,(1925)
Stephen L. Chiu, Fuzzy Model Identification Based on Cluster Estimation Journal of Intelligent and Fuzzy Systems. ,vol. 2, pp. 267- 278 ,(1994) , 10.3233/IFS-1994-2306
Hongying Jiang, Youping Deng, Huann-Sheng Chen, Lin Tao, Qiuying Sha, Jun Chen, Chung-Jui Tsai, Shuanglin Zhang, Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes BMC Bioinformatics. ,vol. 5, pp. 81- 81 ,(2004) , 10.1186/1471-2105-5-81