Exploratory Visualization of Misclassified GPCRs from Their Transformed Unaligned Sequences Using Manifold Learning Techniques

作者: Jesús Giraldo Arjonilla , Martha Ivón Cárdenas Domínguez , Alfredo Vellido Alcacena , Caroline König , René Alquézar Mancho

DOI:

关键词: Pattern recognitionRelevance (information retrieval)SequenceNonlinear dimensionality reductionBoundary (topology)VisualizationPhylogenetic treeData visualizationComputer scienceLimit (mathematics)Machine learningArtificial intelligence

摘要: Class C G-protein-coupled receptors (GPCRs) are cell mem- brane proteins of great relevance to biology and pharmacology. Previous research has revealed an upper boundary on the accuracy that can be achieved in their classification into subtypes from unaligned transfor- mation sequences. To investigate this, we focus sequences have been misclassified using supervised methods. These visualized, a nonlinear dimensionality reduction technique phylogenetic trees, then characterized against rest data and, partic- ularly, cases own subtype. This should help discriminate between different types misclassification build hypotheses about database quality problems extent which GPCR sequence transformations limit subtype discriminability. The re- ported experiments provide proof concept for proposed method.

参考文章(18)
Iván Olier Caparroso, Jesús Giraldo, Martha Ivón Cárdenas, Xavier Rovira, Alfredo Vellido Alcacena, A probabilistic approach to the visual exploration of G Protein-Coupled Receptor sequences the european symposium on artificial neural networks. pp. 233- 238 ,(2011)
Zia-ur Rehman, Muhammad Tayyeb Mirza, Asifullah Khan, Henri Xhaard, Predicting G-protein-coupled receptors families using different physiochemical properties and pseudo amino acid composition. Methods in Enzymology. ,vol. 522, pp. 61- 79 ,(2013) , 10.1016/B978-0-12-407865-9.00004-2
Martha Ivón Cárdenas, Alfredo Vellido, Iván Olier, Xavier Rovira, Jesús Giraldo, Complementing Kernel-Based Visualization of Protein Sequences with Their Phylogenetic Tree computational intelligence methods for bioinformatics and biostatistics. pp. 136- 149 ,(2011) , 10.1007/978-3-642-35686-5_12
Fabrice Rossi, José David Martín, Paulo J.G. Lisboa, Alfredo Vellido Alcacena, Seeing is believing: the importance of visualization in real-world machine learning applications the european symposium on artificial neural networks. pp. 219- 226 ,(2011)
John A. Lee, Michel Verleysen, Nonlinear Dimensionality Reduction ,(2007)
A. Vellido, E. Romero, M. Julià-Sapé, C. Majós, À. Moreno-Torres, J. Pujol, C. Arús, Robust discrimination of glioblastomas from metastatic brain tumors on the basis of single-voxel (1)H MRS NMR in Biomedicine. ,vol. 25, pp. 819- 828 ,(2012) , 10.1002/NBM.1797
Maria Sandberg, Lennart Eriksson, Jörgen Jonsson, Michael Sjöström, Svante Wold, New Chemical Descriptors Relevant for the Design of Biologically Active Peptides. A Multivariate Characterization of 87 Amino Acids Journal of Medicinal Chemistry. ,vol. 41, pp. 2481- 2491 ,(1998) , 10.1021/JM9700575
Mathias Rask-Andersen, Markus Sällman Almén, Helgi B. Schiöth, Trends in the exploitation of novel drug targets Nature Reviews Drug Discovery. ,vol. 10, pp. 579- 590 ,(2011) , 10.1038/NRD3478
Donghui Kuang, Yi Yao, Minghua Wang, N. Pattabiraman, Lakshmi P. Kotra, David R. Hampson, Molecular similarities in the ligand binding pockets of an odorant receptor and the metabotropic glutamate receptors. Journal of Biological Chemistry. ,vol. 278, pp. 42551- 42559 ,(2003) , 10.1074/JBC.M307120200
Vignir Isberg, Stefan Mordalski, Christian Munk, Krzysztof Rataj, Kasper Harpsøe, Alexander S. Hauser, Bas Vroling, Andrzej J. Bojarski, Gert Vriend, David E. Gloriam, GPCRDB: an information system for G protein-coupled receptors Nucleic Acids Research. ,vol. 31, pp. 294- 297 ,(1998) , 10.1093/NAR/GKV1178