Peptide sequence tags for fast database search in mass-spectrometry

作者: Ari Frank , Stephen Tanner , Pavel Pevzner

DOI: 10.1007/11415770_25

关键词:

摘要: Filtration techniques, in the form of rapid elimination candidate sequences while retaining true one, are key ingredients database searches genomics. Although SEQUEST and Mascot sometimes referred to as “BLAST for mass-spectrometry”, algorithmic idea BLAST (filtration) was never implemented these tools. As a result MS/MS protein identification tools becoming too time-consuming many applications including search post-translationally modified peptides. Moreover, matching millions spectra against all known proteins will soon make slow same way that “genome vs. genome” comparisons instantly made slow. We describe development filters dramatically reduce running time effectively remove bottlenecks searching huge space modifications. Our approach, based on probability model determining accuracy sequence tags, achieves superior results compared GutenTag, popular tag generation algorithm.

参考文章(43)
TH Cormen, RL Rivest, CE Leiserson, C Stein, Introduction to Algorithms, 2nd edition. ,(2001)
Robert Tibshirani, Trevor Hastie, Jerome H. Friedman, The Elements of Statistical Learning ,(2001)
Andrew Keller, Samuel Purvine, Alexey I. Nesvizhskii, Sergey Stolyar, David R. Goodlett, Eugene Kolker, Experimental Protein Mixture for Validating Tandem Mass Spectral Analysis OMICS: A Journal of Integrative Biology. ,vol. 6, pp. 207- 212 ,(2002) , 10.1089/153623102760092805
Jane Razumovskaya, Victor Olman, Dong Xu, Edward C. Uberbacher, Nathan C. VerBerkmoes, Robert L. Hettich, Ying Xu, A computational method for assessing peptide- identification reliability in tandem mass spectrometry analysis with SEQUEST. Proteomics. ,vol. 4, pp. 961- 969 ,(2004) , 10.1002/PMIC.200300656
Bin Ma, Kaizhong Zhang, Christopher Hendrie, Chengzhi Liang, Ming Li, Amanda Doherty-Kirby, Gilles Lajoie, PEAKS: powerful software for peptidede novo sequencing by tandem mass spectrometry Rapid Communications in Mass Spectrometry. ,vol. 17, pp. 2337- 2342 ,(2003) , 10.1002/RCM.1196
John T Prince, Mark W Carlson, Rong Wang, Peng Lu, Edward M Marcotte, The need for a public proteomics repository. Nature Biotechnology. ,vol. 22, pp. 471- 472 ,(2004) , 10.1038/NBT0404-471
Shamil Sunyaev, Adam J. Liska, Alexander Golod, Anna Shevchenko, Andrej Shevchenko, MultiTag: multiple error-tolerant sequence tag search for the sequence-similarity identification of proteins by mass spectrometry. Analytical Chemistry. ,vol. 75, pp. 1307- 1315 ,(2003) , 10.1021/AC026199A
Bingwen Lu, Ting Chen, Algorithms for de novo peptide sequencing using tandem mass spectrometry Drug Discovery Today: BIOSILICO. ,vol. 2, pp. 85- 90 ,(2004) , 10.1016/S1741-8364(04)02387-X
David N. Perkins, Darryl J. C. Pappin, David M. Creasy, John S. Cottrell, Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis. ,vol. 20, pp. 3551- 3567 ,(1999) , 10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2
Pavel A Pevzner, Zufar Mulyukov, Vlado Dancik, Chris L Tang, Efficiency of Database Search for Identification of Mutated and Modified Proteins via Mass Spectrometry Genome Research. ,vol. 11, pp. 290- 299 ,(2001) , 10.1101/GR.154101