Targeted Methods for Biomarker Discovery, the Search for a Standard

作者: Catherine Tuglus , Mark J van der Laan

DOI:

关键词:

摘要: More often than not biomarker studies analyze large quantities of variables with complicated and generally unknown correlation structure. There are numerous statistical methods which attempt to unravel these determine the underlying mechanism through identification causally related biomarkers. Results from difficult interpret nearly impossible compare across studies. The FDA has currently called for a standardization protocol detection. In response, we propose targeted variable importance (tVIM) as standardized method discovery. Through use Maximum Likelihood, tVIM provides double robust estimates along formal inference. These measures biologically interpretable causal effect under specified conditions, allowing reproducibility populations. this analysis four different provided by three methods: univariate linear regression (LM), LASSO penalized multiple (Q), two randomForest (RF1 RF2). Their performance is compared in simulation conditions increasing correlation. We interested their ability distinguish “true” relevant biomarkers correlated decoy comparisons based on resulting ranked list each using p-values when available. simulation, coupled data-adaptive model selection outperforms regression, LASSO, more resilient increases application apply all Golub et al 1999 Leukemia data gene lists biological relevance. Both LM also applied van’t Veer breast cancer data. them top 10 most important genes. From results, appears rank genes at its other methods. Given extreme correlations, reduce bias provide realistic discussed.

参考文章(47)
Mark J. van der Laan, Zhuo Yu, Measuring Treatment Effects Using Semiparametric Models ,(2003)
Oliver Bembom, Mark J. van der Laan, Jeffrey W. Fessel, Robert W. Shafer, Data-adaptive Selection Of The Adjustment Set In Variable Importance Estimation ,(2008)
Scott H. Kaufmann, Judith E. Karp, Phyllis A. Svingen, Stan Krajewski, Philip J. Burke, Steven D. Gore, John C. Reed, Elevated Expression of the Apoptotic Regulator Mcl-1 at the Time of Leukemic Relapse Blood. ,vol. 91, pp. 991- 1000 ,(1998) , 10.1182/BLOOD.V91.3.991.991_991_1000
Sandra E. Sinisi, Mark J. van der Laan, Loss-Based Cross-Validated Deletion/Substitution/Addition Algorithms in Estimation Research Papers in Economics. ,(2004)
A Korman, CTLA-4 based therapy (MDX-010) Breast Cancer Research. ,vol. 5, pp. 63- 63 ,(2003) , 10.1186/BCR731
Leo Breiman, Two-Eyed Algorithms and Problems european conference on principles of data mining and knowledge discovery. pp. 9- 9 ,(2003) , 10.1007/978-3-540-39804-2_2
James M. Robins, M. J. van der Laan, Unified Methods for Censored Longitudinal Data and Causality ,(2003)
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
Gabriela Alexe, Sorin Alexe, David E Axelrod, Tibérius O Bonates, Irina I Lozina, Michael Reiss, Peter L Hammer, Breast cancer prognosis by combinatorial analysis of gene expression data. Breast Cancer Research. ,vol. 8, pp. 1- 20 ,(2006) , 10.1186/BCR1512