NGSANE: A Lightweight Production Informatics Framework for High Throughput Data Analysis

作者: Fabian A. Buske , Hugh J. French , Martin A. Smith , Susan J. Clark , Denis C. Bauer

DOI: 10.1093/BIOINFORMATICS/BTU036

关键词:

摘要: Summary: The initial steps in the analysis of next-generation sequencing data can be automated by way software ‘pipelines’. However, individual components depreciate rapidly because evolving technology and methods, often rendering entire versions production informatics pipelines obsolete. Constructing from Linux bash commands enables use hot swappable modular as opposed to more rigid program call wrapping higher level languages, implemented comparable published pipelining systems. Here we present Next Generation Sequencing ANalysis for Enterprises (NGSANE), a Linux-based, high-performance-computing-enabled framework that minimizes overhead set up processing new projects, yet maintains full flexibility custom scripting when raw sequence data. Availability implementation: Ngsane is publicly available under BSD (3-Clause) licence via GitHub at https://github.com/BauerLab/ngsane. Contact: ua.orisc@reuaB.sineD Supplementary information: Supplementary are Bioinformatics online.

参考文章(7)
Paul L. Auer, R. W. Doerge, Statistical Design and Analysis of RNA Sequencing Data Genetics. ,vol. 185, pp. 405- 416 ,(2010) , 10.1534/GENETICS.110.114983
Simon Anders, Davis J McCarthy, Yunshun Chen, Michal Okoniewski, Gordon K Smyth, Wolfgang Huber, Mark D Robinson, Count-based differential expression analysis of RNA sequencing data using R and Bioconductor Nature Protocols. ,vol. 8, pp. 1765- 1786 ,(2013) , 10.1038/NPROT.2013.099
Simon P. Sadedin, Bernard Pope, Alicia Oshlack, Bpipe : A Tool for Running and Managing Bioinformatics Pipelines Bioinformatics. ,vol. 28, pp. 1525- 1526 ,(2012) , 10.1093/BIOINFORMATICS/BTS167
J. Koster, S. Rahmann, Snakemake--a scalable bioinformatics workflow engine. Bioinformatics. ,vol. 28, pp. 2520- 2522 ,(2012) , 10.1093/BIOINFORMATICS/BTS480
C. O. McCoy, A. Gallagher, N. G. Hoffman, F. A. Matsen, Nestly--a framework for running software with nested parameter choices and aggregating results. Bioinformatics. ,vol. 29, pp. 387- 388 ,(2013) , 10.1093/BIOINFORMATICS/BTS696
Jeremy Goecks, Anton Nekrutenko, James Taylor, Galaxy Team team@ galaxyproject. org, Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences Genome Biology. ,vol. 11, pp. 1- 13 ,(2010) , 10.1186/GB-2010-11-8-R86
W. David Kelton, Statistical design and analysis Proceedings of the 18th conference on Winter simulation - WSC '86. pp. 45- 51 ,(1986) , 10.1145/318242.318259