Linear Discriminant Analysis-Based Estimation of the False Discovery Rate for Phosphopeptide Identifications†

作者: Xiuxia Du , Feng Yang , Nathan P Manes , David L Stenoien , Matthew E Monroe

DOI: 10.1021/PR070510T

关键词:

摘要: The development of liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) has made it possible to characterize phosphopeptides in an increasingly large-scale and high-throughput fashion. However, extracting confident phosphopeptide identifications from the resulting large data sets a similar fashion remains difficult, as does rigorously estimating false discovery rate (FDR) set identifications. This article describes analysis pipeline designed address these issues. first step is reanalyze that contain ambiguous assignments for incorporated phosphate(s) determine most likely arrangement phosphate(s). next employ expectation maximization algorithm estimate joint distribution peptide scores. A linear discriminant then performed how optimally combine scores (in this case, SEQUEST) into score possesses maximum discriminating power. Based on score, p- q-values each identification are calculated, FDR estimated. approach was applied study irradiated human skin fibroblasts provide robust phosphopeptides. Phosphopeptide Estimator software freely available download at http://ncrr.pnl.gov/software/.

参考文章(28)
Steven I. Reed, G1/S regulatory mechanisms from yeast to man. Progress in cell cycle research. ,vol. 2, pp. 15- 27 ,(1996) , 10.1007/978-1-4615-5873-6_2
J. D. Storey, R. Tibshirani, Statistical significance for genomewide studies Proceedings of the National Academy of Sciences of the United States of America. ,vol. 100, pp. 9440- 9445 ,(2003) , 10.1073/PNAS.1530509100
Yue Chen, Sung Won Kwon, Sung Chan Kim, Yingming Zhao, Integrated approach for manual evaluation of peptides identified by searching protein sequence databases with tandem mass spectra. Journal of Proteome Research. ,vol. 4, pp. 998- 1005 ,(2005) , 10.1021/PR049754T
Richard A. Redner, Homer F. Walker, Mixture Densities, Maximum Likelihood and the EM Algorithm SIAM Review. ,vol. 26, pp. 195- 239 ,(1984) , 10.1137/1026034
Sean A Beausoleil, Judit Villén, Scott A Gerber, John Rush, Steven P Gygi, A probability-based approach for high-throughput protein phosphorylation analysis and site localization. Nature Biotechnology. ,vol. 24, pp. 1285- 1292 ,(2006) , 10.1038/NBT1240
Daniel López-Ferrer, Salvador Martínez-Bartolomé, Margarita Villar, Mónica Campillos, Fernando Martín-Maroto, Jesús Vázquez, Statistical Model for Large-Scale Peptide Identification in Databases from Tandem Mass Spectra Using SEQUEST Analytical Chemistry. ,vol. 76, pp. 6853- 6860 ,(2004) , 10.1021/AC049305C
Jimmy K. Eng, Ashley L. McCormack, John R. Yates, An Approach to Correlate Tandem Mass Spectral Data of Peptides with Amino Acid Sequences in a Protein Database Journal of the American Society for Mass Spectrometry. ,vol. 5, pp. 976- 989 ,(1994) , 10.1016/1044-0305(94)80016-2
Tony Hunter, Signaling--2000 and beyond. Cell. ,vol. 100, pp. 113- 127 ,(2000) , 10.1016/S0092-8674(00)81688-8
A. P. Dempster, N. M. Laird, D. B. Rubin, Maximum Likelihood from Incomplete Data Via theEMAlgorithm Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 39, pp. 1- 22 ,(1977) , 10.1111/J.2517-6161.1977.TB01600.X
Paul J. Kersey, Jorge Duarte, Allyson Williams, Youla Karavidopoulou, Ewan Birney, Rolf Apweiler, The International Protein Index: an integrated database for proteomics experiments. Proteomics. ,vol. 4, pp. 1985- 1988 ,(2004) , 10.1002/PMIC.200300721