Sequence-level instructions direct transcription at polyT short tandem repeats

作者: Michiel J.L. de Hoon , Piero Carninci , Wyeth W. Wasserman , Yoshihide Hayashizaki , Harukazu Suzuki

DOI: 10.1101/634261

关键词: BiologyMicrosatelliteTranscription (biology)Cap analysis gene expressionPromoterComputational biologyEnhancerGeneTranscriptional noiseRepertoire

摘要: Abstract Using the Cap Analysis of Gene Expression technology, FANTOM5 consortium provided one most comprehensive maps Transcription Start Sites (TSSs) in several species. Strikingly, ~ 72% them could not be assigned to a specific gene and initiate at unconventional regions, outside promoters or enhancers. To determine whether these TSSs, sometimes referred as ‘transcriptional noise’ ‘junk’, are relevant nonetheless, we look for novel conserved regulatory motifs located their vicinity. We show that, all species studied, significant fraction CAGE peaks short tandem repeats (STRs) corresponding homopolymers thymidines. Biochemical genetic evidence further demonstrate that CAGEs correspond TSSs mostly sense intronic non-coding RNAs, whose transcription rate can predicted with 81% accuracy by sequence-based deep learning model. Excitingly, our model predicts variants linked human diseases affect this STR-associated transcription. Together, results extend repertoire provides valuable resource future studies complex traits.

参考文章(66)
Matthew Wiener, Andy Liaw, Classification and Regression by randomForest ,(2007)
Livia Bernardi, Maura Gallo, Maria Anfossi, Maria Elena Conidi, Rosanna Colao, Gianfranco Puccio, Sabrina A.M. Curcio, Francesca Frangipane, Alessandra Clodomiro, Maria Mirabelli, Franca Vasso, Nicoletta Smirne, Raffaele Di Lorenzo, Raffaele Maletta, Amalia C. Bruni, Role of TOMM40 rs10524523 polymorphism in onset of alzheimer's disease caused by the PSEN1 M146L mutation. Journal of Alzheimer's Disease. ,vol. 37, pp. 285- 289 ,(2013) , 10.3233/JAD-130119
Benjamin S. Scruggs, Daniel A. Gilchrist, Sergei Nechaev, Ginger W. Muse, Adam Burkholder, David C. Fargo, Karen Adelman, Bidirectional Transcription Arises from Two Distinct Hubs of Transcription Factor Binding and Active Chromatin. Molecular Cell. ,vol. 58, pp. 1101- 1112 ,(2015) , 10.1016/J.MOLCEL.2015.04.006
Aleksandra Maruszak, Beata Pepłońska, Krzysztof Safranow, Małgorzata Chodakowska-Żebrowska, Maria Barcikowska, Cezary Żekanowski, TOMM40 rs10524523 polymorphism's role in late-onset Alzheimer's disease and in longevity. Journal of Alzheimer's Disease. ,vol. 28, pp. 309- 322 ,(2012) , 10.3233/JAD-2011-110743
Wu Wei, Vicent Pelechano, Aino I. Järvelin, Lars M. Steinmetz, Functional consequences of bidirectional promoters Trends in Genetics. ,vol. 27, pp. 267- 276 ,(2011) , 10.1016/J.TIG.2011.04.002
Fantom Consortium, None, A promoter-level mammalian expression atlas Nature. ,vol. 507, pp. 462- 470 ,(2014) , 10.1038/NATURE13182
Colton Linnertz, Lauren Anderson, William Gottschalk, Donna Crenshaw, Michael W. Lutz, Jawara Allen, Sunita Saith, Mirta Mihovilovic, James R. Burke, Kathleen A. Welsh-Bohmer, Allen D. Roses, Ornit Chiba-Falek, The cis-regulatory effect of an Alzheimer’s disease-associated poly-T locus on expression of TOMM40 and apolipoprotein E genes Alzheimers & Dementia. ,vol. 10, pp. 541- 551 ,(2014) , 10.1016/J.JALZ.2013.08.280
Eran Segal, Jonathan Widom, Poly(dA:dT) tracts: major determinants of nucleosome organization. Current Opinion in Structural Biology. ,vol. 19, pp. 65- 71 ,(2009) , 10.1016/J.SBI.2009.01.004
Mitchell Guttman, Ido Amit, Manuel Garber, Courtney French, Michael F. Lin, David Feldser, Maite Huarte, Or Zuk, Bryce W. Carey, John P. Cassady, Moran N. Cabili, Rudolf Jaenisch, Tarjei S. Mikkelsen, Tyler Jacks, Nir Hacohen, Bradley E. Bernstein, Manolis Kellis, Aviv Regev, John L. Rinn, Eric S. Lander, Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals Nature. ,vol. 458, pp. 223- 227 ,(2009) , 10.1038/NATURE07672
M. Kellis, B. Wold, M. P. Snyder, B. E. Bernstein, A. Kundaje, G. K. Marinov, L. D. Ward, E. Birney, G. E. Crawford, J. Dekker, I. Dunham, L. L. Elnitski, P. J. Farnham, E. A. Feingold, M. Gerstein, M. C. Giddings, D. M. Gilbert, T. R. Gingeras, E. D. Green, R. Guigo, T. Hubbard, J. Kent, J. D. Lieb, R. M. Myers, M. J. Pazin, B. Ren, J. A. Stamatoyannopoulos, Z. Weng, K. P. White, R. C. Hardison, Defining functional DNA elements in the human genome Proceedings of the National Academy of Sciences of the United States of America. ,vol. 111, pp. 6131- 6138 ,(2014) , 10.1073/PNAS.1318948111