Protein-dna binding: discovering motifs and distinguishing direct from indirect interactions

作者: Alexander J. Hartemink , Raluca M. Gordan

DOI:

关键词:

摘要: The initiation of two major processes in the eukaryotic cell, gene transcription and DNA replication, is regulated largely through interactions between proteins or protein complexes DNA. Although a lot known about interacting their role regulating specific binding motifs many regulatory are still to be determined. For this purpose, computational tools for motif discovery have been developed last decades. These employ variety strategies, from exhaustive search sampling techniques, with hope finding over-represented sets co-regulated co-bound sequences. Despite aimed at solving problem discovery, ability correctly detect limited. usually short times degenerate, which makes them difficult distinguish genomic background. We believe most efficient strategy improving performance not use increasingly complex statistical methods models, but incorporate more biology into principled manner. To end, we propose novel algorithm: PRIORITY. Based on general Gibbs framework, PRIORITY has advantage over other tools: it can different types biological information (e.g., nucleosome positioning information) guide sites toward regions where these likely occur nucleosome-free regions). We factor (TF) data yeast chromatin immunoprecipitation (ChIP-chip) experiments test our algorithm when incorporating three information: positioning, double-helical stability, evolutionary conservation information. In each case, additional proven very useful increasing accuracy finding, number identified up 52%. restricted TF data. work, also analyze origin recognition (ORC) show that utilize structural predict specificity ORC. Despite improvement obtained using information, success algorithms identifying limited, especially applied sequences bound vivo (such as those ChIP-chip) because observed protein-DNA necessarily direct. Some TFs associate only indirectly via partners, while others exhibit both direct indirect binding. method TF-DNA interactions, integrating data, occupancy vitro microarrays. When ChIP-chip reveals 48% readily explained by profiled TF, 16% remaining 36%, found none used analysis was able explain ChIPchip either too noisy set incomplete. As become available, build complete catalog interactions.

参考文章(146)
Bing Ren, François Robert, John J Wyrick, Oscar Aparicio, Ezra G Jennings, Itamar Simon, Julia Zeitlinger, Jorg Schreiber, Nancy Hannett, Elenita Kanin, Thomas L Volkert, Christopher J Wilson, Stephen P Bell, Richard A Young, Genome-Wide Location and Function of DNA Binding Proteins Science. ,vol. 290, pp. 2306- 2309 ,(2000) , 10.1126/SCIENCE.290.5500.2306
Weihong Xu, Jennifer G Aparicio, Oscar M Aparicio, Simon Tavaré, Genome-wide mapping of ORC and Mcm2p binding sites on tiling arrays and identification of essential ARS consensus sequences in S. cerevisiae. BMC Genomics. ,vol. 7, pp. 276- 276 ,(2006) , 10.1186/1471-2164-7-276
X. Shirley Liu, Douglas L. Brutlag, Jun S. Liu, An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments. Nature Biotechnology. ,vol. 20, pp. 835- 839 ,(2002) , 10.1038/NBT717
J.Russell Lipford, Stephen P Bell, Nucleosomes positioned by ORC facilitate the initiation of DNA replication. Molecular Cell. ,vol. 7, pp. 21- 30 ,(2001) , 10.1016/S1097-2765(01)00151-4
Tong Ihn Lee, Nicola J Rinaldi, François Robert, Duncan T Odom, Ziv Bar-Joseph, Georg K Gerber, Nancy M Hannett, Christopher T Harbison, Craig M Thompson, Itamar Simon, Julia Zeitlinger, Ezra G Jennings, Heather L Murray, D Benjamin Gordon, Bing Ren, John J Wyrick, Jean-Bosco Tagne, Thomas L Volkert, Ernest Fraenkel, David K Gifford, Richard A Young, Transcriptional Regulatory Networks in Saccharomyces cerevisiae Science. ,vol. 298, pp. 799- 804 ,(2002) , 10.1126/SCIENCE.1075090
W. J. Kent, C. W. Sugnet, T. S. Furey, K. M. Roskin, T. H. Pringle, A. M. Zahler, a. D. Haussler, The Human Genome Browser at UCSC Genome Research. ,vol. 12, pp. 996- 1006 ,(2002) , 10.1101/GR.229102
D. S. Johnson, A. Mortazavi, R. M. Myers, B. Wold, Genome-Wide Mapping of in Vivo Protein-DNA Interactions Science. ,vol. 316, pp. 1497- 1502 ,(2007) , 10.1126/SCIENCE.1141319
Leelavati Narlikar, Raluca Gordân, Alexander J. Hartemink, Nucleosome occupancy information improves de novo motif discovery research in computational molecular biology. pp. 107- 121 ,(2007) , 10.1007/978-3-540-71681-5_8
Bing Li, Michael Carey, Jerry L. Workman, The Role of Chromatin during Transcription Cell. ,vol. 128, pp. 707- 719 ,(2007) , 10.1016/J.CELL.2007.01.015