Estimating the null distribution for conditional inference and genome-scale screening

作者: David R. Bickel

DOI: 10.1111/J.1541-0420.2010.01491.X

关键词:

摘要: In a novel approach to the multiple testing problem, Efron (2004; 2007) formulated estimators of distribution test statistics or nominal p-values under null suitable for modeling data thousands unaffected genes, non-associated single-nucleotide polymorphisms, other biological features. Estimators can improve not only empirical Bayes procedure which it was originally intended, but also many comparison procedures. Such serve as groundwork proposed based on recent frequentist method minimizing posterior expected loss, exemplified with non-additive loss function designed genomic screening rather than validation. The merit estimating is examined from vantage point conditional inference in remainder paper. simulation study genome-scale testing, conditioning observed confidence level estimated an approximate ancillary statistic markedly improved inference. To enable researchers determine whether rely particular decision making, information-theoretic score provided that quantifies benefit conditioning. As sum degree ancillarity and inferential relevance, reflects balance would strike between two conflicting terms. Applications gene expression microarray illustrate methods introduced.

参考文章(41)
Deborah G. Mayo, D. R. Cox, Frequentist statistics as a theory of inductive inference arXiv: Statistics Theory. pp. 77- 97 ,(2006) , 10.1214/074921706000000400
Charles Stein, Inadmissibility of the Usual Estimator for the Mean of a Multivariate Normal Distribution Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics. pp. 197- 206 ,(1956)
Sandrine Dudoit, Juliet Popper Shaffer, Jennifer C. Block, Multiple Hypothesis Testing in Microarray Experiments Statistical Science. ,vol. 18, pp. 71- 103 ,(2003) , 10.1214/SS/1056397487
D. A. S. Fraser, Ancillaries and Conditional Inference Statistical Science. ,vol. 19, pp. 333- 369 ,(2004) , 10.1214/088342304000000323
Bradley Efron, Robert Tibshirani, The problem of regions Annals of Statistics. ,vol. 26, pp. 1687- 1718 ,(1998) , 10.1214/AOS/1024691353
Kun-Liang Lu, Discussion: An Ancillarity Paradox which Appears in Multiple Linear Regression Annals of Statistics. ,vol. 18, pp. 525- 528 ,(1990) , 10.1214/AOS/1176347611
David R. Bickel, Error-Rate and Decision-Theoretic Methods of Multiple Testing: Which Genes Have High Objective Probabilities of Differential Expression? Statistical Applications in Genetics and Molecular Biology. ,vol. 3, pp. 1- 20 ,(2004) , 10.2202/1544-6115.1043