A Faster Reliable Algorithm to Estimate the p-Value of the Multinomial llr Statistic

作者: Uri Keich , Niranjan Nagarajan

DOI: 10.1007/978-3-540-30219-3_10

关键词:

摘要: The subject of estimating the p-value log-likelihood ratio statistic for multinomial distribution has been studied extensively in statistical literature. Nevertheless, bioinformatics laid new challenges before that research by often concentrating its interest on “thin tail” where classical approximation typically fails. Hence, some more recent development this area have come from community ([5], [3]).

参考文章(16)
Noel Cressie, Timothy R. C. Read, Pearson's X 2 and the Loglikelihood Ratio Statistic G 2 : A Comparative Review International Statistical Review / Revue Internationale de Statistique. ,vol. 57, pp. 19- 43 ,(1989) , 10.2307/1403582
Sven Rahmann, Dynamic programming algorithms for two statistical problems in computational biology. workshop on algorithms in bioinformatics. ,vol. 2812, pp. 151- 164 ,(2003) , 10.1007/978-3-540-39763-2_12
Charles Elkan, Timothy L. Bailey, Fitting a mixture model by expectation maximization to discover motifs in biopolymers. intelligent systems in molecular biology. ,vol. 2, pp. 28- 36 ,(1994)
Wassily Hoeffding, Asymptotically Optimal Tests for Multinomial Distributions Annals of Mathematical Statistics. ,vol. 36, pp. 431- 471 ,(1965) , 10.1007/978-1-4612-0865-5_28
G. Z. Hertz, G. D. Stormo, Identifying DNA and protein patterns with statistically significant alignments of multiple sequences. international conference on bioinformatics. ,vol. 15, pp. 563- 577 ,(1999) , 10.1093/BIOINFORMATICS/15.7.563
Karim F. Hirji, A comparison of algorithms for exact goodness-of-fit tests for multinomial data Communications in Statistics - Simulation and Computation. ,vol. 26, pp. 1197- 1227 ,(1997) , 10.1080/03610919708813435
Wilbert C. M. Kallenberg, On Moderate and Large Deviations in Multinomial Distributions Annals of Statistics. ,vol. 13, pp. 1554- 1580 ,(1985) , 10.1214/AOS/1176349755
Uri Keich, sFFT: a faster accurate computation of the p-value of the entropy score. Journal of Computational Biology. ,vol. 12, pp. 416- 430 ,(2005) , 10.1089/CMB.2005.12.416
Jenny Baglivo, Donald Olivier, Marcello Pagano, Methods for Exact Goodness-of-Fit Tests Journal of the American Statistical Association. ,vol. 87, pp. 464- 469 ,(1992) , 10.1080/01621459.1992.10475227
Gill Bejerano, Efficient exact value computation and applications to biosequence analysis Proceedings of the seventh annual international conference on Computational molecular biology - RECOMB '03. pp. 38- 47 ,(2003) , 10.1145/640075.640080