Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features

作者: Yuan Li , Mingjun Wang , Huilin Wang , Hao Tan , Ziding Zhang

DOI: 10.1038/SREP05765

关键词:

摘要: Lysine acetylation is a reversible post-translational modification, playing an important role in cytokine signaling, transcriptional regulation, and apoptosis. To fully understand mechanisms, identification of substrates specific sites crucial. Experimental often time-consuming expensive. Alternative bioinformatics methods are cost-effective can be used high-throughput manner to generate relatively precise predictions. Here we develop method termed as SSPKA for species-specific lysine prediction, using random forest classifiers that combine sequence-derived functional features with two-step feature selection. Feature importance analysis indicates features, applied site prediction the first time, significantly improve predictive performance. We apply model screen entire human proteome identify many high-confidence putative not previously identified. The results along implemented Java tool, serve useful resources elucidate mechanism facilitate hypothesis-driven experimental design validation.

参考文章(60)
David Umlauf, Yuji Goto, Robert Feil, Site-specific analysis of histone methylation and acetylation. Methods of Molecular Biology. ,vol. 287, pp. 99- 120 ,(2004) , 10.1385/1-59259-828-5:099
A. Keith Dunker, Zoran Obradovic, The protein trinity--linking function and disorder. Nature Biotechnology. ,vol. 19, pp. 805- 806 ,(2001) , 10.1038/NBT0901-805
DMP PHILLIPS, The presence of acetyl groups in histones Biochemical Journal. ,vol. 87, pp. 258- 263 ,(1963) , 10.1042/BJ0870258
Matthew Wiener, Andy Liaw, Classification and Regression by randomForest ,(2007)
Katalin F. Medzihradszky, Peptide sequence analysis. Methods in Enzymology. ,vol. 402, pp. 209- 244 ,(2005) , 10.1016/S0076-6879(05)02007-0
Michael Wagner, Rafaℓ Adamczak, Aleksey Porollo, Jarosℓaw Meller, Linear regression models for solvent accessibility prediction in proteins. Journal of Computational Biology. ,vol. 12, pp. 355- 369 ,(2005) , 10.1089/CMB.2005.12.355
Karin Sadoul, Jin Wang, Boubou Diagouraga, Saadi Khochbin, The Tale of Protein Lysine Acetylation in the Cytoplasm BioMed Research International. ,vol. 2011, pp. 970382- 970382 ,(2011) , 10.1155/2011/970382
A Keith Dunker, Christopher J Oldfield, Jingwei Meng, Pedro Romero, Jack Y Yang, Jessica Chen, Vladimir Vacic, Zoran Obradovic, Vladimir N Uversky, The unfoldomics decade: an update on intrinsically disordered proteins BMC Genomics. ,vol. 9, pp. 1- 26 ,(2008) , 10.1186/1471-2164-9-S2-S1
Jiangning Song, Kevin Burrage, Zheng Yuan, Thomas Huber, Prediction of cis / trans isomerization in proteins using PSI-BLAST profiles and secondary structure information BMC Bioinformatics. ,vol. 7, pp. 124- 124 ,(2006) , 10.1186/1471-2105-7-124
Sheng-Bao Suo, Jian-Ding Qiu, Shao-Ping Shi, Xing-Yu Sun, Shu-Yun Huang, Xiang Chen, Ru-Ping Liang, Position-Specific Analysis and Prediction for Protein Lysine Acetylation Based on Multiple Features PLoS ONE. ,vol. 7, pp. e49108- ,(2012) , 10.1371/JOURNAL.PONE.0049108