Sequencing of natural strains of Arabidopsis thaliana with short reads

作者: S. Ossowski , K. Schneeberger , R. M. Clark , C. Lanz , N. Warthmann

DOI: 10.1101/GR.080200.108

关键词:

摘要: Whole-genome hybridization studies have suggested that the nuclear genomes of accessions (natural strains) Arabidopsis thaliana can differ by several percent their sequence. To examine this variation, and as a first step in 1001 Genomes Project for species, we produced 15- to 25-fold coverage Illumina sequencing-by-synthesis (SBS) reads reference accession, Col-0, two divergent strains, Bur-0 Tsu-1. We aligned genome sequence assess data quality metrics detect polymorphisms. Alignments revealed 823,325 unique single nucleotide polymorphisms (SNPs) 79,961 1- 3-bp indels at specificity >99%, over 2000 potential errors also identified >3.4 Mb Tsu-1 being either extremely dissimilar, deleted, or duplicated relative genome. obtain sequences these regions, incorporated Velvet assembler into targeted de novo assembly method. This approach yielded 10,921 high-confidence contigs were anchored flanking harbored large 641 bp. Our methods are broadly applicable polymorphism discovery moderate even highly diverged loci, established subsampling SBS depth required inform broad range functional evolutionary studies. pipeline aligning predicting SNPs indels, SHORE, is available download http://1001genomes.org.

参考文章(24)
Sung Kim, Vincent Plagnol, Tina T Hu, Christopher Toomajian, Richard M Clark, Stephan Ossowski, Joseph R Ecker, Detlef Weigel, Magnus Nordborg, Recombination and linkage disequilibrium in Arabidopsis thaliana Nature Genetics. ,vol. 39, pp. 1151- 1155 ,(2007) , 10.1038/NG2115
Peter Rice, Ian Longden, Alan Bleasby, EMBOSS: The European Molecular Biology Open Software Suite Trends in Genetics. ,vol. 16, pp. 276- 277 ,(2000) , 10.1016/S0168-9525(00)02024-2
LaDeana W Hillier, Gabor T Marth, Aaron R Quinlan, David Dooling, Ginger Fewell, Derek Barnett, Paul Fox, Jarret I Glasscock, Matthew Hickenbotham, Weichun Huang, Vincent J Magrini, Ryan J Richt, Sacha N Sander, Donald A Stewart, Michael Stromberg, Eric F Tsung, Todd Wylie, Tim Schedl, Richard K Wilson, Elaine R Mardis, Whole-genome sequencing and variant discovery in C. elegans. Nature Methods. ,vol. 5, pp. 183- 188 ,(2008) , 10.1038/NMETH.1179
Magnus Nordborg, Tina T Hu, Yoko Ishino, Jinal Jhaveri, Christopher Toomajian, Honggang Zheng, Erica Bakker, Peter Calabrese, Jean Gladstone, Rana Goyal, Mattias Jakobsson, Sung Kim, Yuri Morozov, Badri Padhukasahasram, Vincent Plagnol, Noah A Rosenberg, Chitiksha Shah, Jeffrey D Wall, Jue Wang, Keyan Zhao, Theodore Kalbfleisch, Vincent Schulz, Martin Kreitman, Joy Bergelson, None, The pattern of polymorphism in Arabidopsis thaliana. PLOS Biology. ,vol. 3, pp. 1289- 1299 ,(2005) , 10.1371/JOURNAL.PBIO.0030196
D. Hernandez, P. Francois, L. Farinelli, M. Osteras, J. Schrenzel, De novo bacterial genome sequencing: Millions of very short reads assembled on a desktop computer Genome Research. ,vol. 18, pp. 802- 809 ,(2008) , 10.1101/GR.072033.107
Thomas Mitchell-Olds, Johanna Schmitt, Genetic mechanisms and evolutionary significance of natural variation in Arabidopsis Nature. ,vol. 441, pp. 947- 952 ,(2006) , 10.1038/NATURE04878
J. Kroymann, S. Donnerhacke, D. Schnabelrauch, T. Mitchell-Olds, Evolutionary dynamics of an Arabidopsis insect resistance quantitative trait locus Proceedings of the National Academy of Sciences of the United States of America. ,vol. 100, pp. 14587- 14592 ,(2003) , 10.1073/PNAS.1734046100
R. M. Clark, G. Schweikert, C. Toomajian, S. Ossowski, G. Zeller, P. Shinn, N. Warthmann, T. T. Hu, G. Fu, D. A. Hinds, H. Chen, K. A. Frazer, D. H. Huson, B. Scholkopf, M. Nordborg, G. Ratsch, J. R. Ecker, D. Weigel, Common sequence polymorphisms shaping genetic diversity in Arabidopsis thaliana. Science. ,vol. 317, pp. 338- 342 ,(2007) , 10.1126/SCIENCE.1138632
J. O. Borevitz, S. P. Hazen, T. P. Michael, G. P. Morris, I. R. Baxter, T. T. Hu, H. Chen, J. D. Werner, M. Nordborg, D. E. Salt, S. A. Kay, J. Chory, D. Weigel, J. D. G. Jones, J. R. Ecker, Genome-wide patterns of single-feature polymorphism in Arabidopsis thaliana Proceedings of the National Academy of Sciences of the United States of America. ,vol. 104, pp. 12057- 12062 ,(2007) , 10.1073/PNAS.0705323104