作者: Tao Huang , Lei Chen , Yu-Dong Cai , Kuo-Chen Chou
DOI: 10.1371/JOURNAL.PONE.0025297
关键词: Pseudo amino acid composition 、 Computational biology 、 Graph property 、 Systems biology 、 Information processing 、 Feature (machine learning) 、 Bioinformatics 、 Regulatory Pathway 、 Property (philosophy) 、 Redundancy (engineering) 、 Biology
摘要: Given a regulatory pathway system consisting of set proteins, can we predict which class it belongs to? Such problem is closely related to the biological function in cells and hence quite fundamental essential systems biology proteomics. This also an extremely difficult challenging due its complexity. To address this problem, novel approach was developed that be used query pathways among following six functional categories: (i) “Metabolism”, (ii) “Genetic Information Processing”, (iii) “Environmental (iv) “Cellular Processes”, (v) “Organismal Systems”, (vi) “Human Diseases”. The prediction method established trough procedures: according general form pseudo amino acid composition (PseAAC), each concerned formulated as 5570-D (dimensional) vector; components vector derived by series feature extractions from graphic property, biochemical physicochemical well property; minimum redundancy maximum relevance (mRMR) adopted operate prediction. A cross-validation jackknife test on benchmark dataset 146 indicated overall success rate 78.8% achieved our identifying above classes, indicating outcome promising encouraging. best knowledge, current study represents first effort attempting identity type or function. It anticipated report may stimulate follow-up investigations new area.