Genome wide analysis of Arabidopsis core promoters

作者: Carlos Molina , Erich Grotewold

DOI: 10.1186/1471-2164-6-25

关键词:

摘要: Core promoters are the gene regulatory regions most proximal to transcription start site (TSS), central formation of pre-initiation complexes and for combinatorial regulation. The DNA elements required core promoter function in plants poorly understood. To establish sequence motifs that characterize plant compare them corresponding sequences animals, we took advantage available full-length cDNAs (FL-cDNAs) predicted upstream carry out analysis 12,749 Arabidopsis promoters. Using a combination expectation maximization Gibbs sampling methods, identified several overrepresented One corresponded TATA element, which an in-depth resulted generation robust Nucleotide Frequency Matrices (NFMs) capable predicting with high degree confidence. We established approximately 29% all contain motifs, clustered around position -32 respect TSS. presence was associated genes represented more frequently EST collections shorter 5' UTRs. No cis-elements were found over-represented TATA-less, compared TATA-containing Our studies provide first genome-wide illustration composition structure percentage is much lower than commonly recognized, yet comparable number Drosophila containing element. Although other as promoters, they present only small fraction represent not previously described suggesting distinct architecture animal genes.

参考文章(32)
Charles Elkan, Timothy L. Bailey, The value of prior knowledge in discovering motifs with MEME. intelligent systems in molecular biology. ,vol. 3, pp. 21- 29 ,(1995)
Rajendra R. Joshi, Chintalapati Janaki, Motif detection in Arabidopsis: correlation with gene expression data. in Silico Biology. ,vol. 4, pp. 149- 161 ,(2004)
Uwe Ohler, Heinrich Niemann, Identification and analysis of eukaryotic promoters: recent computational approaches. Trends in Genetics. ,vol. 17, pp. 56- 60 ,(2001) , 10.1016/S0168-9525(00)02174-0
R Mantovani, A survey of 178 NF-Y binding CCAAT boxes Nucleic Acids Research. ,vol. 26, pp. 1135- 1143 ,(1998) , 10.1093/NAR/26.5.1135
T. Bilaud, C. E. Koering, E. Binet-Brasselet, K. Ancelin, A. Pollice, S. M. Gasser, E. Gilson, The Telobox, a Myb-Related Telomeric DNA Binding Motif Found in Proteins from Yeast, Plants and Human Nucleic Acids Research. ,vol. 24, pp. 1294- 1303 ,(1996) , 10.1093/NAR/24.7.1294
Dominique Tremousaygue, Alexandra Manevski, Claude Bardet, Nicole Lescure, Bernard Lescure, Plant interstitial telomere motifs participate in the control of gene expression in root meristems. Plant Journal. ,vol. 20, pp. 553- 561 ,(1999) , 10.1046/J.1365-313X.1999.00627.X
Motoaki Seki, Mari Narusaka, Kazuko Yamaguchi-Shinozaki, Piero Carninci, Jun Kawai, Yoshihide Hayashizaki, Kazuo Shinozaki, Arabidopsis encyclopedia using full-length cDNAs and its application Plant Physiology and Biochemistry. ,vol. 39, pp. 211- 220 ,(2001) , 10.1016/S0981-9428(01)01244-X
Dominique Trémousaygue, Lionel Garnier, Claude Bardet, Patrick Dabos, Christine Hervé, Bernard Lescure, Internal telomeric repeats and 'TCP domain' protein-binding sites co-operate to regulate gene expression in Arabidopsis thaliana cycling cells Plant Journal. ,vol. 33, pp. 957- 966 ,(2003) , 10.1046/J.1365-313X.2003.01682.X
Eric J. Richards, Frederick M. Ausubel, Isolation of a higher eukaryotic telomere from Arabidopsis thaliana Cell. ,vol. 53, pp. 127- 136 ,(1988) , 10.1016/0092-8674(88)90494-1