Reference-free compression of high throughput sequencing data with a probabilistic de Bruijn graph.

作者: Gaëtan Benoit , Claire Lemaitre , Dominique Lavenier , Erwan Drezen , Thibault Dayris

DOI: 10.1186/S12859-015-0709-7

关键词:

摘要: … Other lossy quality scores compression approaches use the … of the quality scores by smoothing unimportant quality values, in … However, read-reordering strategy is acceptable in our …

参考文章(33)
Y. William Yu, Deniz Yorukoglu, Bonnie Berger, Traversing the k-mer Landscape of NGS Read Datasets for Quality Score Sparsification research in computational molecular biology. ,vol. 8394, pp. 385- 399 ,(2014) , 10.1007/978-3-319-05269-4_31
Zamin Iqbal, Mario Caccamo, Isaac Turner, Paul Flicek, Gil McVean, De novo assembly and genotyping of variants using colored de Bruijn graphs Nature Genetics. ,vol. 44, pp. 226- 232 ,(2012) , 10.1038/NG.1028
G. Rizk, D. Lavenier, R. Chikhi, DSK: K-Mer Counting With Very Low Memory Usage Bioinformatics. ,vol. 29, pp. 652- 653 ,(2013) , 10.1093/BIOINFORMATICS/BTT020
Y William Yu, Deniz Yorukoglu, Jian Peng, Bonnie Berger, Quality score compression improves genotyping accuracy Nature Biotechnology. ,vol. 33, pp. 240- 243 ,(2015) , 10.1038/NBT.3170
James K. Bonfield, Matthew V. Mahoney, Compression of FASTQ and SAM format sequencing data. PLOS ONE. ,vol. 8, ,(2013) , 10.1371/JOURNAL.PONE.0059190
Carl Kingsford, Rob Patro, Reference-based compression of short-read sequences using path encoding Bioinformatics. ,vol. 31, pp. 1920- 1928 ,(2015) , 10.1093/BIOINFORMATICS/BTV071
H. Li, R. Durbin, Fast and accurate short read alignment with Burrows–Wheeler transform Bioinformatics. ,vol. 25, pp. 1754- 1760 ,(2009) , 10.1093/BIOINFORMATICS/BTP324
R. Chikhi, P. Medvedev, Informed and automated k-mer size selection for genome assembly Bioinformatics. ,vol. 30, pp. 31- 37 ,(2014) , 10.1093/BIOINFORMATICS/BTT310
Rayan Chikhi, Guillaume Rizk, Space-efficient and exact de Bruijn graph representation based on a Bloom filter Algorithms for Molecular Biology. ,vol. 8, pp. 22- 22 ,(2013) , 10.1186/1748-7188-8-22
H. Li, B. Handsaker, A. Wysoker, T. Fennell, J. Ruan, N. Homer, G. Marth, G. Abecasis, R. Durbin, , The Sequence Alignment/Map format and SAMtools Bioinformatics. ,vol. 25, pp. 2078- 2079 ,(2009) , 10.1093/BIOINFORMATICS/BTP352