作者: Jesús Giraldo Arjonilla , Martha Ivón Cárdenas Domínguez , Alfredo Vellido Alcacena , Caroline König , René Alquézar Mancho
DOI:
关键词: Pattern recognition 、 Relevance (information retrieval) 、 Sequence 、 Nonlinear dimensionality reduction 、 Boundary (topology) 、 Visualization 、 Phylogenetic tree 、 Data visualization 、 Computer science 、 Limit (mathematics) 、 Machine learning 、 Artificial intelligence
摘要: Class C G-protein-coupled receptors (GPCRs) are cell mem- brane proteins of great relevance to biology and pharmacology. Previous research has revealed an upper boundary on the accuracy that can be achieved in their classification into subtypes from unaligned transfor- mation sequences. To investigate this, we focus sequences have been misclassified using supervised methods. These visualized, a nonlinear dimensionality reduction technique phylogenetic trees, then characterized against rest data and, partic- ularly, cases own subtype. This should help discriminate between different types misclassification build hypotheses about database quality problems extent which GPCR sequence transformations limit subtype discriminability. The re- ported experiments provide proof concept for proposed method.