Light-weight reference-based compression of FASTQ data

作者: Yongpeng Zhang , Linsen Li , Yanli Yang , Xiao Yang , Shan He

DOI: 10.1186/S12859-015-0628-7

关键词:

摘要: … compression algorithm namely LW-FQZip to compress … , metadata, short reads and quality score strings, are first parsed … are utilized to compress the metadata and quality score streams, …

参考文章(31)
Yongpeng Zhang, Linsen Li, Jun Xiao, Yanli Yang, Zexuan Zhu, FQZip: Lossless Reference-Based Compression of Next Generation Sequencing Data in FASTQ Format Springer, Cham. pp. 127- 135 ,(2015) , 10.1007/978-3-319-13356-0_11
Ben Langmead, Aligning Short Sequencing Reads with Bowtie Current protocols in human genetics. ,vol. 32, ,(2010) , 10.1002/0471250953.BI1107S32
Mark Howison, High-Throughput Compression of FASTQ Data with SeqDB IEEE/ACM Transactions on Computational Biology and Bioinformatics. ,vol. 10, pp. 213- 218 ,(2013) , 10.1109/TCBB.2012.160
Niko Popitsch, Arndt von Haeseler, NGC: lossless and lossy compression of aligned high-throughput sequencing data Nucleic Acids Research. ,vol. 41, ,(2013) , 10.1093/NAR/GKS939
Idoia Ochoa, Himanshu Asnani, Dinesh Bharadia, Mainak Chowdhury, Tsachy Weissman, Golan Yona, QualComp: a new lossy compressor for quality scores based on rate distortion theory BMC Bioinformatics. ,vol. 14, pp. 187- 187 ,(2013) , 10.1186/1471-2105-14-187
Jiarui Zhou, Zhen Ji, Zexuan Zhu, Shan He, Compression of next-generation sequencing quality scores using memetic algorithm BMC Bioinformatics. ,vol. 15, pp. 1- 7 ,(2014) , 10.1186/1471-2105-15-S15-S10
R. Giancarlo, S. E. Rombo, F. Utro, Compressive biological sequence analysis and archival in the era of high-throughput sequencing technologies Briefings in Bioinformatics. ,vol. 15, pp. 390- 406 ,(2014) , 10.1093/BIB/BBT088
Christos Kozanitis, Chris Saunders, Semyon Kruglyak, Vineet Bafna, George Varghese, Compressing genomic sequence fragments using SlimGene. Journal of Computational Biology. ,vol. 18, pp. 401- 413 ,(2011) , 10.1089/CMB.2010.0253
Erwin L. van Dijk, Hélène Auger, Yan Jaszczyszyn, Claude Thermes, Ten years of next-generation sequencing technology. Trends in Genetics. ,vol. 30, pp. 418- 426 ,(2014) , 10.1016/J.TIG.2014.07.001