Evaluation of inductive and transductive inference in the context of translation initiation site

作者: Wallison W. Guimarães , Cristiano L. N. Pinto , Cristiane N. Nobre , Luis E. Zárate

DOI: 10.1145/3167132.3167368

关键词: Context (language use)Class (biology)Transduction (machine learning)Translation initiation sitesCaenorhabditis elegansUpstream (networking)Computer scienceSupport vector machineInductive reasoningComputational biology

摘要: The prediction of Translation Initiation Site (TIS) from a mRNA (Ribonucleic Acid Messenger) is relevant and latent problem molecular biology, which has benefited the evolution computational techniques machine learning (ML). There are some scenarios where dataset either does not have enough classified sequences to train precise model, or it an upstream region, such as Caenorhabditis elegans. In this article, we compare inductive transductive approaches for TIS prediction, using methodology that disregards region. With proposed methodology, achieved 95% training accuracy, only 2.5% belonging elegans class, many available but 75% Rattus norvegicus fewer available, approach. Our results demonstrate viability approach with sequences, common situation organisms incomplete gene sequencing.

参考文章(8)
George Tzanis, Christos Berberidis, Ioannis Vlahavas, A novel data mining approach for the accurate prediction of translation initiation sites international conference on biological and medical data analysis. pp. 92- 103 ,(2006) , 10.1007/11946465_9
Evgeny Kondratovich, Igor I. Baskin, Alexandre Varnek, Transductive Support Vector Machines: Promising Approach to Model Small and Unbalanced Datasets Molecular Informatics. ,vol. 32, pp. 261- 266 ,(2013) , 10.1002/MINF.201200135
S. Nakagawa, Y. Niimura, T. Gojobori, H. Tanaka, K.-i. Miura, Diversity of preferred nucleotide sequences around the translation initiation codon in eukaryote genomes Nucleic Acids Research. ,vol. 36, pp. 861- 871 ,(2007) , 10.1093/NAR/GKM1102
A. G. Hatzigeorgiou, Translation initiation start prediction in human cDNAs with high accuracy. Bioinformatics. ,vol. 18, pp. 343- 350 ,(2002) , 10.1093/BIOINFORMATICS/18.2.343
Chih-Chung Chang, Chih-Jen Lin, LIBSVM ACM Transactions on Intelligent Systems and Technology. ,vol. 2, pp. 1- 27 ,(2011) , 10.1145/1961189.1961199
Cristiano Lacerda Nunes Pinto, Cristiane Neri Nobre, Luis Enrique Zárate, Transductive learning as an alternative to translation initiation site identification BMC Bioinformatics. ,vol. 18, pp. 81- 81 ,(2017) , 10.1186/S12859-017-1502-6