Maximum Entropy Modeling of Short Sequence Motifs with Applications to RNA Splicing Signals

作者: Gene Yeo , Christopher B. Burge

DOI: 10.1089/1066527041410418

关键词:

摘要: We propose a framework for modeling sequence motifs based on the maximum entropy principle (MEP). recommend approximating short motif distributions with distribution (MED) consistent low-order marginal constraints estimated from available data, which may include dependencies between nonadjacent as well adjacent positions. Many models (MEMs) are specified by simply changing set of constraints. Such can be utilized to discriminate signals and decoys. Classification performance using different MEMs gives insight into relative importance apply our large datasets RNA splicing signals. Our best out-perform previous probabilistic in discrimination human 5' (donor) 3' (acceptor) splice sites Finally, we discuss mechanistically motivated ways comparing models.

参考文章(23)
L. Varani, M. Hasegawa, M. G. Spillantini, M. J. Smith, J. R. Murrell, B. Ghetti, A. Klug, M. Goedert, G. Varani, Structure of tau exon 10 splicing regulatory element RNA and destabilization by mutations of frontotemporal dementia and parkinsonism linked to chromosome 17 Proceedings of the National Academy of Sciences of the United States of America. ,vol. 96, pp. 8229- 8234 ,(1999) , 10.1073/PNAS.96.14.8229
Laura Eng, Gabriela Coutinho, Shareef Nahas, Gene Yeo, Robert Tanouye, Mahnoush Babaei, Thilo Dörk, Christopher Burge, Richard A. Gatti, Nonclassical splicing mutations in the coding and noncoding regions of the ATM Gene: maximum entropy estimates of splice junction strengths. Human Mutation. ,vol. 23, pp. 67- 76 ,(2004) , 10.1002/HUMU.10295
J. Swets, Measuring the accuracy of diagnostic systems Science. ,vol. 240, pp. 1285- 1293 ,(1988) , 10.1126/SCIENCE.3287615
W. G. Fairbrother, Predictive Identification of Exonic Splicing Enhancers in Human Genes Science. ,vol. 297, pp. 1007- 1013 ,(2002) , 10.1126/SCIENCE.1073774
E. T. Jaynes, Information Theory and Statistical Mechanics Physical Review. ,vol. 106, pp. 620- 630 ,(1957) , 10.1103/PHYSREV.106.620
J. Shore, R. Johnson, Axiomatic derivation of the principle of maximum entropy and the principle of minimum cross-entropy IEEE Transactions on Information Theory. ,vol. 26, pp. 26- 37 ,(1980) , 10.1109/TIT.1980.1056144
David T. Brown, A note on approximations to discrete probability distributions Information & Computation. ,vol. 2, pp. 386- 392 ,(1959) , 10.1016/S0019-9958(59)80016-4
C. T. IRELAND, S. KULLBACK, Contingency tables with given marginals Biometrika. ,vol. 55, pp. 179- 188 ,(1968) , 10.1093/BIOMET/55.1.179
P.M. Lewis, Approximating probability distributions to reduce storage requirements Information & Computation. ,vol. 2, pp. 214- 225 ,(1959) , 10.1016/S0019-9958(59)90207-4