Whole-Genome Sequence Assembly for Mammalian Genomes: Arachne 2

作者: David B Jaffe , Jonathan Butler , Sante Gnerre , Evan Mauceli , Kerstin Lindblad-Toh

DOI: 10.1101/GR.828403

关键词:

摘要: We previously described the whole-genome assembly program Arachne, presenting assemblies of simulated data for small to mid-sized genomes. Here we describe algorithmic adaptations program, allowing mammalian-size genomes, and also improving smaller Three principal changes were simultaneously made applied mouse genome, during a six-month period development: (1) Supercontigs (scaffolds) iteratively broken rejoined using several criteria, yielding 64-fold increase in length (N50), apparent elimination all global misjoins; (2) gaps between contigs supercontigs filled (partially or completely) by insertion reads, as suggested pairing within supercontig, increasing N50 contig 50%; (3) memory usage was reduced fourfold. The outcome this its analysis are (Mouse Genome Sequencing Consortium 2002).

参考文章(17)
Thomas J. Hudson, Deanna M. Church, Simon Greenaway, Huy Nguyen, April Cook, Robert G. Steen, William J. Van Etten, Andrew B. Castle, Mark A. Strivens, Pamela Trickett, Christine Heuston, Claire Davison, Anne Southwell, Rachel Hardisty, Anabel Varela-Carver, Andrew R. Haynes, Patricia Rodriguez-Tome, Hirofumi Doi, Minoru S.H. Ko, Joan Pontius, Lynn Schriml, Lukas Wagner, Donna Maglott, Steve D.M. Brown, Eric S. Lander, Greg Schuler, Paul Denny, A radiation hybrid map of mouse genes. Nature Genetics. ,vol. 29, pp. 201- 205 ,(2001) , 10.1038/NG1001-201
Eugene W Myers, Granger G Sutton, Art L Delcher, Ian M Dew, Dan P Fasulo, Michael J Flanigan, Saul A Kravitz, Clark M Mobarry, Knut HJ Reinert, Karin A Remington, Eric L Anson, Randall A Bolanos, Hui-Hsien Chou, Catherine M Jordan, Aaron L Halpern, Stefano Lonardi, Ellen M Beasley, Rhonda C Brandon, Lin Chen, Patrick J Dunn, Zhongwu Lai, Yong Liang, Deborah R Nusskern, Ming Zhan, Qing Zhang, Xiangqun Zheng, Gerald M Rubin, Mark D Adams, J Craig Venter, None, A Whole-Genome Assembly of Drosophila Science. ,vol. 287, pp. 2196- 2204 ,(2000) , 10.1126/SCIENCE.287.5461.2196
Robert D Fleischmann, Mark D Adams, Owen White, Rebecca A Clayton, Ewen F Kirkness, Anthony R Kerlavage, Carol J Bult, Jean-Francois Tomb, Brian A Dougherty, Joseph M Merrick, Keith McKenney, Granger Sutton, Will FitzHugh, Chris Fields, Jeannine D Gocayne, John Scott, Robert Shirley, Li-lng Liu, Anna Glodek, Jenny M Kelley, Janice F Weidman, Cheryl A Phillips, Tracy Spriggs, Eva Hedblom, Matthew D Cotton, Teresa R Utterback, Michael C Hanna, David T Nguyen, Deborah M Saudek, Rhonda C Brandon, Leah D Fine, Janice L Fritchman, Joyce L Fuhrmann, NSM Geoghagen, Cheryl L Gnehm, Lisa A McDonald, Keith V Small, Claire M Fraser, Hamilton O Smith, J Craig Venter, Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. ,vol. 269, pp. 496- 512 ,(1995) , 10.1126/SCIENCE.7542800
William F. Dietrich, Joyce Miller, Robert Steen, Mark A. Merchant, Deborah Damron-Boles, Zeeshan Husain, Robert Dredge, Mark J. Daly, Kimberly A. Ingalls, Tara J O'Connor, Cheryl A. Evans, Margaret M. DeAngelis, David M. Levinson, Leonid Kruglyak, Nathan Goodman, Neal G. Copeland, Nancy A. Jenkins, Trevor L. Hawkins, Lincoln Stein, David C. Page, Eric S. Lander, A comprehensive genetic map of the mouse genome Nature. ,vol. 380, pp. 149- 152 ,(1996) , 10.1038/380149A0
Al Edwards, Hartmut Voss, Peter Rice, Andrew Civitello, Josef Stegemann, Christian Schwager, Juergen Zimmermann, Holger Erfle, C.Thomas Caskey, Wilhelm Ansorge, Automated DNA sequencing of the human HPRT locus Genomics. ,vol. 6, pp. 593- 608 ,(1990) , 10.1016/0888-7543(90)90493-E
F. Sanger, A.R. Coulson, G.F. Hong, D.F. Hill, G.B. Petersen, Nucleotide sequence of bacteriophage λ DNA Journal of Molecular Biology. ,vol. 162, pp. 729- 773 ,(1982) , 10.1016/0022-2836(82)90546-0
Richard J Mural, Mark D Adams, Eugene W Myers, Hamilton O Smith, George L Gabor Miklos, Ron Wides, Aaron Halpern, Peter W Li, Granger G Sutton, Joe Nadeau, Steven L Salzberg, Robert A Holt, Chinnappa D Kodira, Fu Lu, Lin Chen, Zuoming Deng, Carlos C Evangelista, Weiniu Gan, Thomas J Heiman, Jiayin Li, Zhenya Li, Gennady V Merkulov, Natalia V Milshina, Ashwinikumar K Naik, Rong Qi, Bixiong Chris Shue, Aihui Wang, Jian Wang, Xin Wang, Xianghe Yan, Jane Ye, Shibu Yooseph, Qi Zhao, Liansheng Zheng, Shiaoping C Zhu, Kendra Biddick, Randall Bolanos, Arthur L Delcher, Ian M Dew, Daniel Fasulo, Michael J Flanigan, Daniel H Huson, Saul A Kravitz, Jason R Miller, Clark M Mobarry, Knut Reinert, Karin A Remington, Qing Zhang, Xiangqun H Zheng, Deborah R Nusskern, Zhongwu Lai, Yiding Lei, Wenyan Zhong, Alison Yao, Ping Guan, Rui-Ru Ji, Zhiping Gu, Zhen-Yuan Wang, Fei Zhong, Chunlin Xiao, Chia-Chien Chiang, Mark Yandell, Jennifer R Wortman, Peter G Amanatides, Suzanne L Hladun, Eric C Pratts, Jeffery E Johnson, Kristina L Dodson, Kerry J Woodford, Cheryl A Evans, Barry Gropman, Douglas B Rusch, Eli Venter, Mei Wang, Thomas J Smith, Jarrett T Houck, Donald E Tompkins, Charles Haynes, Debbie Jacob, Soo H Chin, David R Allen, Carl E Dahlke, Robert Sanders, Kelvin Li, Xiangjun Liu, Alexander A Levitsky, William H Majoros, Quan Chen, Ashley C Xia, John R Lopez, Michael T Donnelly, Matthew H Newman, Anna Glodek, Cheryl L Kraft, Marc Nodell, Feroze Ali, Hui-Jin An, Danita Baldwin-Pitts, Karen Y Beeson, Shuang Cai, Mark Carnes, Amy Carver, Parris M Caulk, Angela Center, Yen-Hui Chen, Ming-Lai Cheng, My D Coyne, Michelle Crowder, Steven Danaher, Lionel B Davenport, Raymond Desilets, Susanne M Dietz, Lisa Doup, Patrick Dullaghan, Steven Ferriera, Carl R Fosler, Harold C Gire, Andres Gluecksmann, Jeannine D Gocayne, Jonathan Gray, Brit Hart, Jason Haynes, Jeffery Hoover, Tim Howland, Chinyere Ibegwam, Mena Jalali, David Johns, Leslie Kline, Daniel S Ma, Steven MacCawley, Anand Magoon, Felecia Mann, David May, Tina C McIntosh, Somil Mehta, Linda Moy, Mee C Moy, Brian J Murphy, Sean D Murphy, Keith A Nelson, Zubeda Nuri, Kimberly A Parker, Alexandre C Prudhomme, Vinita N Puri, Hina Qureshi, John C Raley, Matthew S Reardon, Megan A Regier, Yu-Hui C Rogers, Deanna L Romblad, None, A Comparison of Whole-Genome Shotgun-Derived Mouse Chromosome 16 and the Human Genome Science. ,vol. 296, pp. 1661- 1671 ,(2002) , 10.1126/SCIENCE.1069193
Shaying Zhao, Sofiya Shatsman, Bola Ayodeji, Keita Geer, Getahun Tsegaye, Margaret Krol, Elizabeth Gebregeorgis, Alla Shvartsbeyn, Daniel Russell, Larry Overton, Lingxia Jiang, George Dimitrov, Kevin Tran, Jyoti Shetty, Joel A Malek, Tamara Feldblyum, William C Nierman, Claire M Fraser, Mouse BAC Ends Quality Assessment and Sequence Analyses Genome Research. ,vol. 11, pp. 1736- 1745 ,(2001) , 10.1101/GR.179201
Kazutoyo Osoegawa, Kazutoyo Osoegawa, Eirik Frengen, Eirik Frengen, Aaron G. Mammoser, Yoshihide Hayashizaki, Joseph J. Catanese, Joseph J. Catanese, Minako Tateno, Peng Yeong Woon, Peng Yeong Woon, Pieter J. de Jong, Pieter J. de Jong, Bacterial Artificial Chromosome Libraries for Mouse Sequencing and Functional Analysis Genome Research. ,vol. 10, pp. 116- 128 ,(2000) , 10.1101/GR.10.1.116
J Craig Venter, Mark D Adams, Eugene W Myers, Peter W Li, Richard J Mural, Granger G Sutton, Hamilton O Smith, Mark Yandell, Cheryl A Evans, Robert A Holt, Jeannine D Gocayne, Peter Amanatides, Richard M Ballew, Daniel H Huson, Jennifer Russo Wortman, Qing Zhang, Chinnappa D Kodira, Xiangqun H Zheng, Lin Chen, Marian Skupski, Gangadharan Subramanian, Paul D Thomas, Jinghui Zhang, George L Gabor Miklos, Catherine Nelson, Samuel Broder, Andrew G Clark, Joe Nadeau, Victor A McKusick, Norton Zinder, Arnold J Levine, Richard J Roberts, Mel Simon, Carolyn Slayman, Michael Hunkapiller, Randall Bolanos, Arthur Delcher, Ian Dew, Daniel Fasulo, Michael Flanigan, Liliana Florea, Aaron Halpern, Sridhar Hannenhalli, Saul Kravitz, Samuel Levy, Clark Mobarry, Knut Reinert, Karin Remington, Jane Abu-Threideh, Ellen Beasley, Kendra Biddick, Vivien Bonazzi, Rhonda Brandon, Michele Cargill, Ishwar Chandramouliswaran, Rosane Charlab, Kabir Chaturvedi, Zuoming Deng, Valentina Di Francesco, Patrick Dunn, Karen Eilbeck, Carlos Evangelista, Andrei E Gabrielian, Weiniu Gan, Wangmao Ge, Fangcheng Gong, Zhiping Gu, Ping Guan, Thomas J Heiman, Maureen E Higgins, Rui-Ru Ji, Zhaoxi Ke, Karen A Ketchum, Zhongwu Lai, Yiding Lei, Zhenya Li, Jiayin Li, Yong Liang, Xiaoying Lin, Fu Lu, Gennady V Merkulov, Natalia Milshina, Helen M Moore, Ashwinikumar K Naik, Vaibhav A Narayan, Beena Neelam, Deborah Nusskern, Douglas B Rusch, Steven Salzberg, Wei Shao, Bixiong Shue, Jingtao Sun, Zhen Yuan Wang, Aihui Wang, Xin Wang, Jian Wang, Ming-Hui Wei, Ron Wides, Chunlin Xiao, Chunhua Yan, Alison Yao, Jane Ye, Ming Zhan, Weiqing Zhang, Hongyu Zhang, Qi Zhao, Liansheng Zheng, Fei Zhong, Wenyan Zhong, Shiaoping C Zhu, Shaying Zhao, Dennis Gilbert, Suzanna Baumhueter, Gene Spier, Christine Carter, Anibal Cravchik, Trevor Woodage, Feroze Ali, Huijin An, Aderonke Awe, Danita Baldwin, Holly Baden, Mary Barnstead, Ian Barrow, Karen Beeson, Dana Busam, Amy Carver, Angela Center, Ming Lai Cheng, Liz Curry, Steve Danaher, Lionel Davenport, Raymond Desilets, Susanne Dietz, Kristina Dodson, Lisa Doup, Steven Ferriera, Neha Garg, Andres Gluecksmann, Brit Hart, Jason Haynes, Charles Haynes, Cheryl Heiner, Suzanne Hladun, Damon Hostin, Jarrett Houck, Timothy Howland, Chinyere Ibegwam, Jeffery Johnson, Francis Kalush, The Sequence of the Human Genome Science. ,vol. 291, pp. 1304- 1351 ,(2001) , 10.1126/SCIENCE.1058040