Weighted sequence motifs as an improved seeding step in microRNA target prediction algorithms

作者: OLA SaeTrom , Ola Snøve , Pål Sætrom

DOI: 10.1261/RNA.7290705

关键词: Set (abstract data type)Complementarity (molecular biology)Duplex (telecommunications)GeneticsUntranslated regionSequence motifStability (learning theory)AlgorithmSensitivity (control systems)BiologySequence

摘要: We present a new microRNA target prediction algorithm called TargetBoost, and show that the is stable identifies more true targets than do existing algorithms. TargetBoost uses machine learning on set of validated in lower organisms to create weighted sequence motifs capture binding characteristics between microRNAs their targets. Existing algorithms require candidates have (1) near-perfect complementarity microRNAs' 5' end targets; (2) relatively high thermodynamic duplex stability; (3) multiple sites target's 3' UTR; (4) evolutionary conservation species. Most use one two first requirements seeding step, three others as filters improve method's specificity. The initial step determines an algorithm's sensitivity also influences its As all may add increase specificity, we propose methods should be compared before such filtering. TargetBoost's motif approach favorable using both stability steps. (TargetBoost available Web tool from http://www.interagon.com/demo/.).

参考文章(42)
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
Ron Kohavi, A study of cross-validation and bootstrap for accuracy estimation and model selection international joint conference on artificial intelligence. ,vol. 2, pp. 1137- 1143 ,(1995)
Gary Ruvkun, Brenda J. Reinhart, Frank J. Slack, Michael Basson, Amy E. Pasquinelli, Jill C. Bettinger, Ann E. Rougvie, H. Robert Horvitz, The 21-nucleotide let-7 RNA regulates developmental timing in Caenorhabditis elegans Nature. ,vol. 403, pp. 901- 906 ,(2000) , 10.1038/35002607
Amy E Pasquinelli, Brenda J Reinhart, Frank Slack, Mark Q Martindale, Mitzi I Kuroda, Betsy Maller, David C Hayward, Eldon E Ball, Bernard Degnan, Peter Müller, Jürg Spring, Ashok Srinivasan, Mark Fishman, John Finnerty, Joseph Corbo, Michael Levine, Patrick Leahy, Eric Davidson, Gary Ruvkun, None, Conservation of the sequence and temporal expression of let-7 heterochronic regulatory RNA Nature. ,vol. 408, pp. 86- 89 ,(2000) , 10.1038/35040556
Charles E. Metz, Benjamin A. Herman, Cheryl A. Roe, Statistical Comparison of Two ROC-curve Estimates Obtained from Partially-paired Datasets Medical Decision Making. ,vol. 18, pp. 110- 121 ,(1998) , 10.1177/0272989X9801800118
Lee P Lim, Margaret E Glasner, Soraya Yekta, Christopher B Burge, David P Bartel, Vertebrate microRNA genes. Science. ,vol. 299, pp. 1540- 1540 ,(2003) , 10.1126/SCIENCE.1080372
R. C. Lee, An Extensive Class of Small RNAs in Caenorhabditis elegans Science. ,vol. 294, pp. 862- 864 ,(2001) , 10.1126/SCIENCE.1065329
Pål Sætrom, Ola Snøve, A comparison of siRNA efficacy predictors Biochemical and Biophysical Research Communications. ,vol. 321, pp. 247- 253 ,(2004) , 10.1016/J.BBRC.2004.06.116