Analysis of the Quality and Utility of Random Shotgun Sequencing at Low Redundancies

作者: John Bouck , Webb Miller , James H. Gorrell , Donna Muzny , Richard A. Gibbs

DOI: 10.1101/GR.8.10.1074

关键词:

摘要: The currently favored approach for sequencing the human genome involves selecting representative large-insert clones (100–200 kb), randomly shearing this DNA to construct shotgun libraries, and then many different isolates from library. This method, entitled directed random sequencing, requires highly redundant obtain a complete accurate finished consensus sequence. Recently it has been suggested that rapidly generated lower redundancy sequence might be of use scientific community. Low-redundancy examined previously using simulated data sets. Here we utilize trace number projects submitted GenBank perform reconstruction experiments mimic low-redundancy sequencing. These sequences have completeness quality product, information content, usefulness interspecies comparisons.The presented here suggest three strategies, each with utilities. (1) Nearly can obtained by library at sixfold redundancy. may therefore represent good point switch approach. (2) Sequencing performed as little twofold find most about exons, EST hits, putative exon similarity matches. (3) To contiguity coding regions, three- fourfold would appropriate. From these results, useful intermediate product Such allow large amount biologically extracted while postponing majority work involved in producing high

参考文章(24)
J. C. Venter, GENOMICS: Shotgun Sequencing of the Human Genome Science. ,vol. 280, pp. 1540- 1542 ,(1998) , 10.1126/SCIENCE.280.5369.1540
Anthony Favello, LaDeana Hillier, Richard K. Wilson, Genomic DNA sequencing methods. Methods in Cell Biology. ,vol. 48, pp. 551- 569 ,(1995) , 10.1016/S0091-679X(08)61403-X
Laurie Goodman, Random Shotgun Fire Genome Research. ,vol. 8, pp. 567- 568 ,(1998) , 10.1101/GR.8.6.567
Ross C. Hardison, John Oeltjen, Webb Miller, Long Human–Mouse Sequence Alignments Reveal Novel Regulatory Elements: A Reason to Sequence the Mouse Genome Genome Research. ,vol. 7, pp. 959- 966 ,(1997) , 10.1101/GR.7.10.959
Maynard Olson, Phil Green, A “Quality-First” Credo for the Human Genome Project Genome Research. ,vol. 8, pp. 414- 415 ,(1998) , 10.1101/GR.8.5.414
Eric S. Lander, Michael S. Waterman, Genomic mapping by fingerprinting random clones: A mathematical analysis Genomics. ,vol. 2, pp. 231- 239 ,(1988) , 10.1016/0888-7543(88)90007-9
Geoffrey D. Smith, Kenneth E. Bernstein, BULLET: a computer simulation of shotgun DNA sequencing. Bioinformatics. ,vol. 11, pp. 155- 157 ,(1995) , 10.1093/BIOINFORMATICS/11.2.155
Stephanie L Chissoe, Marco A Marra, LaDeana Hillier, Ryan Brinkman, Richard K Wilson, Robert H Waterston, Representation of cloned genomic sequences in two sequencing vectors: correlation of DNA sequence and subclone distribution Nucleic Acids Research. ,vol. 25, pp. 2960- 2966 ,(1997) , 10.1093/NAR/25.15.2960
Jared C. Roach, Cecilie Boysen, Kai Wang, Leroy Hood, Pairwise end sequencing: a unified approach to genomic mapping and sequencing Genomics. ,vol. 26, pp. 345- 353 ,(1995) , 10.1016/0888-7543(95)80219-C