摘要: Core promoters are the gene regulatory regions most proximal to transcription start site (TSS), central formation of pre-initiation complexes and for combinatorial regulation. The DNA elements required core promoter function in plants poorly understood. To establish sequence motifs that characterize plant compare them corresponding sequences animals, we took advantage available full-length cDNAs (FL-cDNAs) predicted upstream carry out analysis 12,749 Arabidopsis promoters. Using a combination expectation maximization Gibbs sampling methods, identified several overrepresented One corresponded TATA element, which an in-depth resulted generation robust Nucleotide Frequency Matrices (NFMs) capable predicting with high degree confidence. We established approximately 29% all contain motifs, clustered around position -32 respect TSS. presence was associated genes represented more frequently EST collections shorter 5' UTRs. No cis-elements were found over-represented TATA-less, compared TATA-containing Our studies provide first genome-wide illustration composition structure percentage is much lower than commonly recognized, yet comparable number Drosophila containing element. Although other as promoters, they present only small fraction represent not previously described suggesting distinct architecture animal genes.