Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse

作者: Deanna M Church , Leo Goodstadt , LaDeana W Hillier , Michael C Zody , Steve Goldstein

DOI: 10.1371/JOURNAL.PBIO.1000112

关键词: Sequence assemblyBiologyComparative genomicsGeneWhole genome sequencingPopulationGene familyHuman genomeGeneticsGenomeComputational biology

摘要: The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive of biology only possible with availability finished, high-quality genome assembly. finished clone-based assembly strain C57BL/6J reported here has over 175,000 fewer gaps 139 Mb more novel sequence, compared earlier MGSCv3 draft In analysis this revised are now able to define 20,210 protein-coding genes, thousand than predicted in (19,042 genes). addition, identified 439 long, non–protein-coding RNAs evidence transcribed orthologs human. We analyzed complex repetitive landscape 267 sequence was missing or misassembled previously published assembly, provide insights into reasons its resistance sequencing by whole-genome shotgun approaches. Duplicated regions within newly assembled tend be recent ancestry duplicates draft, correcting our initial evolution on lineage. These appear largely composed containing transposable elements duplicated genes; these, some may fixed population, but at least 40% segmentally sequences copy number variable even among laboratory strains. Mouse lineage-specific contain 3,767 genes drawn mainly from rapidly-changing gene families associated reproductive functions. therefore, greatly improves rodent-specific allows delineation ancestral biological functions shared derived not.

参考文章(80)
Hui Huang, Eitan E Winter, Huajun Wang, Keith G Weinstock, Heming Xing, Leo Goodstadt, Peter D Stenson, David N Cooper, Douglas Smith, M Mar Albà, Chris P Ponting, Kim Fechtel, Evolutionary conservation and selection of human disease gene orthologs in the rat and mouse genomes. Genome Biology. ,vol. 5, pp. 1- 15 ,(2004) , 10.1186/GB-2004-5-7-R47
Emma Whitelaw, David I.K. Martin, Retrotransposons as epigenetic mediators of phenotypic variation in mammals. Nature Genetics. ,vol. 27, pp. 361- 365 ,(2001) , 10.1038/86850
William R. Downs, Louis L. Jacobs, THE EVOLUTION OF MURINE RODENTS IN ASIA National Science Museum monographs. ,vol. 8, pp. 149- 156 ,(1994)
Thomas J. Hudson, Deanna M. Church, Simon Greenaway, Huy Nguyen, April Cook, Robert G. Steen, William J. Van Etten, Andrew B. Castle, Mark A. Strivens, Pamela Trickett, Christine Heuston, Claire Davison, Anne Southwell, Rachel Hardisty, Anabel Varela-Carver, Andrew R. Haynes, Patricia Rodriguez-Tome, Hirofumi Doi, Minoru S.H. Ko, Joan Pontius, Lynn Schriml, Lukas Wagner, Donna Maglott, Steve D.M. Brown, Eric S. Lander, Greg Schuler, Paul Denny, A radiation hybrid map of mouse genes. Nature Genetics. ,vol. 29, pp. 201- 205 ,(2001) , 10.1038/NG1001-201
Monica J. Justice, Capitalizing on large-scale mouse mutagenesis screens. Nature Reviews Genetics. ,vol. 1, pp. 109- 115 ,(2000) , 10.1038/35038549
Chimpanzee Sequencing and Analysis Consortium Waterson Robert H. waterston@ gs. washington. edu Lander Eric S. lander@ broad. mit. edu Wilson Richard K. rwilson@ watson. wustl. edu, None, Initial sequence of the chimpanzee genome and comparison with the human genome Nature. ,vol. 437, pp. 69- 87 ,(2005) , 10.1038/NATURE04072
Zoë Birtle, Leo Goodstadt, Chris Ponting, Duplication and positive selection among hominin-specific PRAME genes BMC Genomics. ,vol. 6, pp. 120- 120 ,(2005) , 10.1186/1471-2164-6-120
Jeffrey A. Bailey, Evan E. Eichler, Primate segmental duplications: crucibles of evolution, diversity and disease Nature Reviews Genetics. ,vol. 7, pp. 552- 564 ,(2006) , 10.1038/NRG1895
Sébastien Dadé, Isabelle Callebaut, Pascal Mermillod, Philippe Monget, Identification of a new expanding family of genes characterized by atypical LRR domains. Localization of a cluster preferentially expressed in oocyte. FEBS Letters. ,vol. 555, pp. 533- 538 ,(2003) , 10.1016/S0014-5793(03)01341-3