Deciphering cis-regulatory logic with 100 million random promoters

作者: Ronen Sadeh , Nir Friedman , Carl G. de Boer , Aviv Regev

DOI: 10.1101/224907

关键词: ChromatinDNAComputational biologyTranscriptional regulationBiologyGene expressionRegulatory sequenceTranscription factorPromoterGenomics

摘要: Predicting how transcription factors (TFs) interpret regulatory sequences to control gene expression remains a major challenge. Past studies have primarily focused on native or engineered sequences, and thus remained limited in scale. Here, we use random as an alternative, measuring the output of over 100 million synthetic yeast promoters comprised DNA. Random yield broad range reproducible levels, indicating that fortuitous binding sites DNA are functional. From these data learn models transcriptional regulation explain 94% variation test data, recapitulate organization chromatin yeast, characterize activity TFs, help refine cis-regulatory motifs. We find strand, position, helical face preferences TFs widespread depend interactions with neighboring chromatin. Such high-throughput assays provide large-scale necessary complex logic.

参考文章(66)
Chao Deng, Timothy Daley, Andrew Smith, Applications of species accumulation curves in large-scale biological data analysis. Quantitative biology (Beijing, China). ,vol. 3, pp. 135- 144 ,(2015) , 10.1007/S40484-015-0049-7
Gwenael Badis, Esther T Chan, Harm van Bakel, Lourdes Pena-Castillo, Desiree Tillo, Kyle Tsui, Clayton D Carlson, Andrea J Gossett, Michael J Hasinoff, Christopher L Warren, Marinella Gebbia, Shaheynoor Talukder, Ally Yang, Sanie Mnaimneh, Dimitri Terterov, David Coburn, Ai Li Yeo, Zhen Xuan Yeo, Neil D Clarke, Jason D Lieb, Aseem Z Ansari, Corey Nislow, Timothy R Hughes, None, A library of yeast transcription factor motifs reveals a widespread function for Rsc3 in targeting nucleosome exclusion at promoters. Molecular Cell. ,vol. 32, pp. 878- 887 ,(2008) , 10.1016/J.MOLCEL.2008.11.020
Eran Segal, Jonathan Widom, From DNA sequence to transcriptional behaviour: a quantitative approach Nature Reviews Genetics. ,vol. 10, pp. 443- 456 ,(2009) , 10.1038/NRG2591
Ophir Shalem, Eilon Sharon, Shai Lubliner, Ifat Regev, Maya Lotan-Pompan, Zohar Yakhini, Eran Segal, Systematic Dissection of the Sequence Determinants of Gene 3’ End Mediated Expression Control PLOS Genetics. ,vol. 11, pp. e1005147- ,(2015) , 10.1371/JOURNAL.PGEN.1005147
Frank W. Albert, Leonid Kruglyak, The role of regulatory variation in complex traits and disease Nature Reviews Genetics. ,vol. 16, pp. 197- 212 ,(2015) , 10.1038/NRG3891
Takashi Sato, M.Cecilia Lopez, Shigemi Sugioka, Yoshifumi Jigami, Henry V. Baker, Hiroshi Uemura, The E-box DNA binding protein Sgc1p suppresses thegcr2mutation, which is involved in transcriptional activation of glycolytic genes inSaccharomyces cerevisiae FEBS Letters. ,vol. 463, pp. 307- 311 ,(1999) , 10.1016/S0014-5793(99)01654-3
Z. Zhang, C. J. Wippo, M. Wal, E. Ward, P. Korber, B. F. Pugh, A packing mechanism for nucleosome organization reconstituted across a eukaryotic genome Science. ,vol. 332, pp. 977- 980 ,(2011) , 10.1126/SCIENCE.1200508