作者: Amit Kawalia , Susanne Motameny , Stephan Wonczak , Holger Thiele , Lech Nieroda
DOI: 10.1371/JOURNAL.PONE.0126321
关键词:
摘要: Next generation sequencing (NGS) has been a great success and is now standard method of research in the life sciences. With this technology, dozens whole genomes or hundreds exomes can be sequenced rather short time, producing huge amounts data. Complex bioinformatics analyses are required to turn these data into scientific findings. In order run fast, automated workflows implemented on high performance computers state art. While providing sufficient compute power storage meet NGS challenge, computing (HPC) systems require special care when utilized for throughput processing. This especially true if HPC system shared by different users. Here, stability, robustness maintainability as important speed throughput. To achieve all aims, dedicated solutions have developed. paper, we present tricks twists that implementation our exome processing workflow. It may serve guideline other analysis projects using similar infrastructure. The code implementing provided supporting information files.