MzJava: An open source library for mass spectrometry data processing.

作者: Oliver Horlacher , Frederic Nikitin , Davide Alocci , Julien Mariethoz , Markus Müller

DOI: 10.1016/J.JPROT.2015.06.013

关键词:

摘要: Abstract Mass spectrometry (MS) is a widely used and evolving technique for the high-throughput identification of molecules in biological samples. The need sharing reuse code among bioinformaticians working with MS data prompted design implementation MzJava, an open-source Java Application Programming Interface (API) related processing. MzJava provides structures algorithms representing processing mass spectra their associated molecules, such as metabolites, glycans peptides. includes functionality to perform calculation, peak (e.g. centroiding, filtering, transforming), spectrum alignment clustering, protein digestion, fragmentation peptides well scoring functions spectrum–spectrum peptide/glycan-spectrum matches. For import export implements readers writers commonly formats. many classes support Hadoop MapReduce (hadoop.apache.org) Apache Spark (spark.apache.org) frameworks cluster computing was implemented. library has been developed applying best practices software engineering. To ensure that contains correct easy use library's API carefully designed thoroughly tested. project distributed under AGPL v3.0 licence. requires 1.7 or higher. Binaries, source documentation can be downloaded from http://mzjava.expasy.org https://bitbucket.org/sib-pig/mzjava . This article part Special Issue entitled: Computational Proteomics.

参考文章(41)
Paul Duvall, Andrew Glover, Stephen M. Matyas, Continuous Integration: Improving Software Quality and Reducing Risk (The Addison-Wesley Signature Series) Addison-Wesley Professional. ,(2007)
Felipe da Veiga Leprevost, Valmir C. Barbosa, Eduardo L. Francisco, Yasset Perez-Riverol, Paulo C. Carvalho, On best practices in the development of bioinformatics software. Frontiers in Genetics. ,vol. 5, pp. 199- 199 ,(2014) , 10.3389/FGENE.2014.00199
Elizabeth K Nelson, Britt Piehler, Josh Eckels, Adam Rauch, Matthew Bellew, Peter Hussey, Sarah Ramsay, Cory Nathe, Karl Lum, Kevin Krouse, David Stearns, Brian Connolly, Tom Skillman, Mark Igra, LabKey Server: An open source platform for scientific data integration, analysis and collaboration BMC Bioinformatics. ,vol. 12, pp. 71- 71 ,(2011) , 10.1186/1471-2105-12-71
Joshua Bloch, How to design a good API and why it matters conference on object-oriented programming systems, languages, and applications. pp. 506- 507 ,(2006) , 10.1145/1176617.1176622
Bruno Domon, Catherine E Costello, A systematic nomenclature for carbohydrate fragmentations in FAB-MS/MS spectra of glycoconjugates Glycoconjugate Journal. ,vol. 5, pp. 397- 409 ,(1988) , 10.1007/BF01049915
Yasset Perez-Riverol, Julian Uszkoreit, Aniel Sanchez, Tobias Ternent, Noemi Del Toro, Henning Hermjakob, Juan Antonio Vizcaíno, Rui Wang, None, ms-data-core-api: an open-source, metadata-oriented library for computational proteomics. Bioinformatics. ,vol. 31, pp. 2903- 2905 ,(2015) , 10.1093/BIOINFORMATICS/BTV250
Yasset Perez-Riverol, Rui Wang, Henning Hermjakob, Markus Müller, Vladimir Vesada, Juan Antonio Vizcaíno, None, Open source libraries and frameworks for mass spectrometry based proteomics: A developer's perspective☆ Biochimica et Biophysica Acta. ,vol. 1844, pp. 63- 76 ,(2014) , 10.1016/J.BBAPAP.2013.02.032
Jeremy Freeman, Nikita Vladimirov, Takashi Kawashima, Yu Mu, Nicholas J Sofroniew, Davis V Bennett, Joshua Rosen, Chao-Tsung Yang, Loren L Looger, Misha B Ahrens, Mapping brain activity at scale with cluster computing Nature Methods. ,vol. 11, pp. 941- 950 ,(2014) , 10.1038/NMETH.3041
S. Herget, R. Ranzinger, K. Maass, C.-W.v.d. Lieth, GlycoCT—a unifying sequence format for carbohydrates Carbohydrate Research. ,vol. 343, pp. 2162- 2171 ,(2008) , 10.1016/J.CARRES.2008.03.011
Marc Vaudel, Harald Barsnes, Frode S. Berven, Albert Sickmann, Lennart Martens, SearchGUI: An open‐source graphical user interface for simultaneous OMSSA and X!Tandem searches Proteomics. ,vol. 11, pp. 996- 999 ,(2011) , 10.1002/PMIC.201000595