SignS: a parallelized, open-source, freely available, web-based tool for gene selection and molecular signatures for survival and censored data.

作者: Ramon Diaz-Uriarte

DOI: 10.1186/1471-2105-9-30

关键词:

摘要: Censored data are increasingly common in many microarray studies that attempt to relate gene expression patient survival. Several new methods have been proposed the last two years. Most of these methods, however, not available biomedical researchers, leading re-implementations from scratch ad-hoc, and suboptimal, approaches with survival data. We developed SignS (Signatures for Survival data), an open-source, freely-available, web-based tool R package selection, building molecular signatures, prediction implements four which, according existing reviews, perform well and, by being a very different nature, offer complementary approaches. use parallel computing via MPI, large decreases user waiting time. Cross-validation is used asses predictive performance stability solutions, latter issue increasing concern given there often several solutions similar performance. Biological interpretation results enhanced because genes signatures models can be sent other freely-available on-line tools examination PubMed references, GO terms, KEGG Reactome pathways selected genes. first analysis data, one few researchers as target users. also bioinformatics applications extensively parallelization, including fault tolerance crash recovery. Because its combination implemented, usage computing, code availability, links additional bases, unique tool, will immediate relevance biostatisticians bioinformaticians.

参考文章(48)
Torsten Hothorn, Peter Bühlmann, Boosting Algorithms: Regularization, Prediction and Model Fitting. Rejoinder. Statistical Science. pp. 516- 522 ,(2007)
Sandrine Dudoit, Jane Fridlyand, Classification in microarray experiments Chapman and Hall/CRC. ,(2003) , 10.1201/9780203011232.CH3
Cameron Hughes, Tracy Hughes, Parallel and distributed programming using C ,(2003)
Joaquin Dopazo, Francisco Azuaje, Data Analysis and Visualization in Genomics and Proteomics John Wiley. ,(2005)
Sandrine Dudoit, Robert C. Gentleman, John Quackenbush, Open source software for the analysis of microarray data. BioTechniques. ,vol. 34, pp. 45- 51 ,(2003) , 10.2144/MAR03DUDOIT
Erricos John Kontoghiorghes, Handbook of Parallel Computing and Statistics ,(2020)
Ramón Díaz-Uriarte, Supervised Methods with Genomic Data: a Review and Cautionary View Data Analysis and Visualization in Genomics and Proteomics. pp. 193- 214 ,(2005) , 10.1002/0470094419.CH12
Ramón Díaz-Uriarte, Sara Alvarez de Andrés, Gene selection and classification of microarray data using random forest BMC Bioinformatics. ,vol. 7, pp. 3- 3 ,(2006) , 10.1186/1471-2105-7-3