Close sequence comparisons are sufficient to identify human cis-regulatory elements

作者: Shyam Prabhakar , Francis Poulin , Malak Shoukry , Veena Afzal , Edward M Rubin

DOI: 10.1101/GR.4717506

关键词:

摘要: Cross-species DNA sequence comparison is the primary method used to identify functional noncoding elements in human and other large genomes. However, little known about relative merits of evolutionarily close distant comparisons. To address this problem, we identified conserved regions primate, mammalian, more comparisons using a uniform approach (Gumby) that facilitates unbiased assessment impact evolutionary distance on predictive power. We benchmarked computational predictions against previously cis-regulatory at diverse genomic loci also tested numerous extremely human–rodent sequences for transcriptional enhancer activity an vivo assay transgenic mice. Human regulatory were with acceptable sensitivity (53%–80%) true-positive rate (27%–67%) by one five eutherian mammals or six simian primates. More (marsupial, avian, amphibian, fish) failed many empirically defined elements. Our results highlight practical utility comparisons, loss entailed derived intuitive relationship between ancient recent conservation from whole-genome comparative analysis explains most observations empirical benchmarking. Lastly, determined that, addition strength conservation, location and/or density surrounding must be considered selecting candidate enhancers testing embryonic time points.

参考文章(37)
Michael Brudno, Michael Chapman, Berthold Göttgens, Serafim Batzoglou, Burkhard Morgenstern, None, Fast and sensitive multiple alignment of large genomic sequences. BMC Bioinformatics. ,vol. 4, pp. 66- 66 ,(2003) , 10.1186/1471-2105-4-66
Len A. Pennacchio, Edward M. Rubin, Genomic strategies to identify mammalian regulatory sequences Nature Reviews Genetics. ,vol. 2, pp. 100- 109 ,(2001) , 10.1038/35052548
Albin Sandelin, Peter Bailey, Sara Bruce, Pär G Engström, Joanna M Klos, Wyeth W Wasserman, Johan Ericson, Boris Lenhard, Arrays of ultraconserved non-coding regions span the loci of key developmental genes in vertebrate genomes BMC Genomics. ,vol. 5, pp. 99- 99 ,(2004) , 10.1186/1471-2164-5-99
S. Brenner, G. Elgar, R. Sanford, A. Macrae, B. Venkatesh, S. Aparicio, Characterization of the pufferfish ( Fugu ) genome as a compact model vertebrate genome Nature. ,vol. 366, pp. 265- 268 ,(1993) , 10.1038/366265A0
K. A. Frazer, L. Pachter, A. Poliakov, E. M. Rubin, I. Dubchak, VISTA: computational tools for comparative genomics Nucleic Acids Research. ,vol. 32, pp. 273- 279 ,(2004) , 10.1093/NAR/GKH458
Richard J. Davis, Weiping Shen, Tiffany A. Heanue, G. Mardon, Mouse Dach, a homologue of Drosophila dachshund, is expressed in the developing retina, brain and limbs. Development Genes and Evolution. ,vol. 209, pp. 526- 536 ,(1999) , 10.1007/S004270050285
S Altschula, Warren Gisha, Webb Millerb, E Meyersc, D Lipmana, None, Basic Local Alignment Search Tool Journal of Molecular Biology. ,vol. 215, pp. 403- 410 ,(1990) , 10.1016/S0022-2836(05)80360-2
Marcelo A Nobrega, Ivan Ovcharenko, Veena Afzal, Edward M Rubin, Scanning Human Gene Deserts for Long-Range Enhancers Science. ,vol. 302, pp. 413- 413 ,(2003) , 10.1126/SCIENCE.1088328
Xavier Caubit, Rajikala Thangarajah, Thomas Theil, Jutta Wirth, Hans-Gerd Nothwang, Ulrich R�ther, Stefan Krauss, Mouse Dac, a novel nuclear factor with homology to Drosophila dachshund shows a dynamic expression in the neural crest, the eye, the neocortex, and the limb bud. Developmental Dynamics. ,vol. 214, pp. 66- 80 ,(1999) , 10.1002/(SICI)1097-0177(199901)214:1<66::AID-DVDY7>3.0.CO;2-7