LRSim: A Linked-Reads Simulator Generating Insights for Better Genome Partitioning.

作者: Ruibang Luo , Fritz J. Sedlazeck , Charlotte A. Darby , Stephen M. Kelly , Michael C. Schatz

DOI: 10.1016/J.CSBJ.2017.10.002

关键词:

摘要: Linked-read sequencing, using highly-multiplexed genome partitioning and barcoding, can span hundreds of kilobases to improve de novo assembly, haplotype phasing, other applications. Based on our analysis 14 datasets, we introduce LRSim that simulates linked-reads by emulating the library preparation sequencing process with fine control over variants, linked-read characteristics, short-read profile. We conclude from phasing assembly multiple recommendations coverage, fragment length, when genomes different sizes complexities. These optimizations results orders magnitude, enable development novel methods. is available at https://github.com/aquaskyline/LRSIM.

参考文章(18)
Hongzhi Cao, Honglong Wu, Ruibang Luo, Shujia Huang, Yuhui Sun, Xin Tong, Yinlong Xie, Binghang Liu, Hailong Yang, Hancheng Zheng, Jian Li, Bo Li, Yu Wang, Fang Yang, Peng Sun, Siyang Liu, Peng Gao, Haodong Huang, Jing Sun, Dan Chen, Guangzhu He, Weihua Huang, Zheng Huang, Yue Li, Laurent CAM Tellier, Xiao Liu, Qiang Feng, Xun Xu, Xiuqing Zhang, Lars Bolund, Anders Krogh, Karsten Kristiansen, Radoje Drmanac, Snezana Drmanac, Rasmus Nielsen, Songgang Li, Jian Wang, Huanming Yang, Yingrui Li, Gane Ka-Shu Wong, Jun Wang, None, De novo assembly of a haplotype-resolved human genome Nature Biotechnology. ,vol. 33, pp. 617- 622 ,(2015) , 10.1038/NBT.3200
Matthew W. Snyder, Andrew Adey, Jacob O. Kitzman, Jay Shendure, Haplotype-resolved genome sequencing: experimental methods and applications Nature Reviews Genetics. ,vol. 16, pp. 344- 358 ,(2015) , 10.1038/NRG3903
Matthew Pendleton, Robert Sebra, Andy Wing Chun Pang, Ajay Ummat, Oscar Franzen, Tobias Rausch, Adrian M Stütz, William Stedman, Thomas Anantharaman, Alex Hastie, Heng Dai, Markus Hsi-Yang Fritz, Han Cao, Ariella Cohain, Gintaras Deikus, Russell E Durrett, Scott C Blanchard, Roger Altman, Chen-Shan Chin, Yan Guo, Ellen E Paxinos, Jan O Korbel, Robert B Darnell, W Richard McCombie, Pui-Yan Kwok, Christopher E Mason, Eric E Schadt, Ali Bashir, Assembly and diploid architecture of an individual human genome via single-molecule technologies Nature Methods. ,vol. 12, pp. 780- 786 ,(2015) , 10.1038/NMETH.3454
Z. Li, Y. Chen, D. Mu, J. Yuan, Y. Shi, H. Zhang, J. Gan, N. Li, X. Hu, B. Liu, B. Yang, W. Fan, Comparison of the two major classes of assembly algorithms: overlap-layout-consensus and de-bruijn-graph. Briefings in Functional Genomics. ,vol. 11, pp. 25- 37 ,(2012) , 10.1093/BFGP/ELR035
M. Diekhans, F. Kokocinski, B. L. Aken, D. Barrell, A. Zadissa, S. Searle, I. Barnes, A. Bignell, V. Boychenko, T. Hunt, M. Kay, G. Mukherjee, J. Rajan, G. Despacio-Reyes, G. Saunders, C. Steward, R. Harte, M. Lin, C. Howald, A. Tanzer, T. Derrien, J. Chrast, N. Walters, S. Balasubramanian, B. Pei, M. Tress, J. M. Rodriguez, I. Ezkurdia, J. van Baren, M. Brent, D. Haussler, M. Kellis, A. Valencia, A. Reymond, M. Gerstein, R. Guigo, T. J. Hubbard, J. Harrow, A. Frankish, J. M. Gonzalez, E. Tapanari, GENCODE: The reference human genome annotation for The ENCODE Project Genome Research. ,vol. 22, pp. 1760- 1774 ,(2012) , 10.1101/GR.135350.111
H. Li, B. Handsaker, A. Wysoker, T. Fennell, J. Ruan, N. Homer, G. Marth, G. Abecasis, R. Durbin, , The Sequence Alignment/Map format and SAMtools Bioinformatics. ,vol. 25, pp. 2078- 2079 ,(2009) , 10.1093/BIOINFORMATICS/BTP352
Ruibang Luo, Binghang Liu, Yinlong Xie, Zhenyu Li, Weihua Huang, Jianying Yuan, Guangzhu He, Yanxiang Chen, Qi Pan, Yunjie Liu, Jingbo Tang, Gengxiong Wu, Hao Zhang, Yujian Shi, Yong Liu, Chang Yu, Bo Wang, Yao Lu, Changlei Han, David W Cheung, Siu-Ming Yiu, Shaoliang Peng, Zhu Xiaoqian, Guangming Liu, Xiangke Liao, Yingrui Li, Huanming Yang, Jian Wang, Tak-Wah Lam, Jun Wang, None, SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler GigaScience. ,vol. 1, pp. 30- 30 ,(2012) , 10.1186/2047-217X-1-18
Dent Earl, Keith Bradnam, John St John, Aaron Darling, Dawei Lin, Joseph Fass, Hung On Ken Yu, Vince Buffalo, Daniel R Zerbino, Mark Diekhans, Ngan Nguyen, Pramila Nuwantha Ariyaratne, Wing-Kin Sung, Zemin Ning, Matthias Haimel, Jared T Simpson, Nuno A Fonseca, İnanç Birol, T Roderick Docking, Isaac Y Ho, Daniel S Rokhsar, Rayan Chikhi, Dominique Lavenier, Guillaume Chapuis, Delphine Naquin, Nicolas Maillet, Michael C Schatz, David R Kelley, Adam M Phillippy, Sergey Koren, Shiaw-Pyng Yang, Wei Wu, Wen-Chi Chou, Anuj Srivastava, Timothy I Shaw, J Graham Ruby, Peter Skewes-Cox, Miguel Betegon, Michelle T Dimon, Victor Solovyev, Igor Seledtsov, Petr Kosarev, Denis Vorobyev, Ricardo Ramirez-Gonzalez, Richard Leggett, Dan MacLean, Fangfang Xia, Ruibang Luo, Zhenyu Li, Yinlong Xie, Binghang Liu, Sante Gnerre, Iain MacCallum, Dariusz Przybylski, Filipe J Ribeiro, Shuangye Yin, Ted Sharpe, Giles Hall, Paul J Kersey, Richard Durbin, Shaun D Jackman, Jarrod A Chapman, Xiaoqiu Huang, Joseph L DeRisi, Mario Caccamo, Yingrui Li, David B Jaffe, Richard E Green, David Haussler, Ian Korf, Benedict Paten, None, Assemblathon 1: A competitive assessment of de novo short read assembly methods Genome Research. ,vol. 21, pp. 2224- 2241 ,(2011) , 10.1101/GR.126599.111
D. R. Zerbino, E. Birney, Velvet: Algorithms for de novo short read assembly using de Bruijn graphs Genome Research. ,vol. 18, pp. 821- 829 ,(2008) , 10.1101/GR.074492.107
Grace X Y Zheng, Billy T Lau, Michael Schnall-Levin, Mirna Jarosz, John M Bell, Christopher M Hindson, Sofia Kyriazopoulou-Panagiotopoulou, Donald A Masquelier, Landon Merrill, Jessica M Terry, Patrice A Mudivarti, Paul W Wyatt, Rajiv Bharadwaj, Anthony J Makarewicz, Yuan Li, Phillip Belgrader, Andrew D Price, Adam J Lowe, Patrick Marks, Gerard M Vurens, Paul Hardenbol, Luz Montesclaros, Melissa Luo, Lawrence Greenfield, Alexander Wong, David E Birch, Steven W Short, Keith P Bjornson, Pranav Patel, Erik S Hopmans, Christina Wood, Sukhvinder Kaur, Glenn K Lockwood, David Stafford, Joshua P Delaney, Indira Wu, Heather S Ordonez, Susan M Grimes, Stephanie Greer, Josephine Y Lee, Kamila Belhocine, Kristina M Giorda, William H Heaton, Geoffrey P McDermott, Zachary W Bent, Francesca Meschi, Nikola O Kondov, Ryan Wilson, Jorge A Bernate, Shawn Gauby, Alex Kindwall, Clara Bermejo, Adrian N Fehr, Adrian Chan, Serge Saxonov, Kevin D Ness, Benjamin J Hindson, Hanlee P Ji, Haplotyping germline and cancer genomes with high-throughput linked-read sequencing Nature Biotechnology. ,vol. 34, pp. 303- 311 ,(2016) , 10.1038/NBT.3432