The Bayesian Elastic Net: Classifying Multi-Task Gene-Expression Data

作者: Aimee Zaas , Lawrence Carin , Geoffrey S. Ginsburg , Minhua Chen , Alfred Hero

DOI:

关键词: InferenceLaplace distributionBayesian averageMachine learningData setComputer scienceElastic net regularizationAlgorithmBayesian probabilityFeature selectionArtificial intelligencePrior probability

摘要: Highly correlated relevant features are frequently encountered in variable-selection problems, with gene-expression analysis an important example. It is desirable to select all of these highly simultaneously as a group, for better model interpretation and robustness. Further, irrelevant should be excluded, resulting sparse solution (of importance avoiding over-fitting limited data). We address the problem grouped variable selection by introducing new Bayesian Elastic Net model. One advantage proposed that imposing priors on individual parameters Laplace distribution, we reduce number tuning one, compared two such original Net. In addition, extend probit regression, order deal classification problems but set covariates (features). Extension multi-task learning also considered, inference performed using variational analysis. The validated first performing experiments simulated data previously published data; perform comparisons Lasso. Finally, present analyze time-evolving properties influenza, measured blood samples from human subjects recent challenge study.

参考文章(25)
Geoffrey J. McLachlan, Christophe Ambroise, Kim-Anh Do, Analyzing Microarray Gene Expression Data ,(2004)
Javed Khan, Jun S Wei, Markus Ringner, Lao H Saal, Marc Ladanyi, Frank Westermann, Frank Berthold, Manfred Schwab, Cristina R Antonescu, Carsten Peterson, Paul S Meltzer, None, Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks Nature Medicine. ,vol. 7, pp. 673- 679 ,(2001) , 10.1038/89044
Jim E. Griffin, Philip J. Brown, Bayesian adaptive lassos with non-convex penalization University of Warwick. Centre for Research in Statistical Methodology. ,(2007)
Minjung Kyung, Jeff Gill, Malay Ghosh, George Casella, Penalized regression, standard errors, and Bayesian lassos Bayesian Analysis. ,vol. 5, pp. 369- 411 ,(2010) , 10.1214/10-BA607
Trevor Park, George Casella, The Bayesian Lasso Journal of the American Statistical Association. ,vol. 103, pp. 681- 686 ,(2008) , 10.1198/016214508000000337
Scott Shaobing Chen, David L. Donoho, Michael A. Saunders, Atomic Decomposition by Basis Pursuit SIAM Journal on Scientific Computing. ,vol. 20, pp. 33- 61 ,(1998) , 10.1137/S1064827596304010
Qing Li, Nan Lin, The Bayesian elastic net Bayesian Analysis. ,vol. 5, pp. 151- 170 ,(2010) , 10.1214/10-BA506
Zhenqiu Liu, Feng Jiang, Guoliang Tian, Suna Wang, Fumiaki Sato, Stephen J. Meltzer, Ming Tan, Sparse Logistic Regression with Lp Penalty for Biomarker Identification Statistical Applications in Genetics and Molecular Biology. ,vol. 6, pp. 6- ,(2007) , 10.2202/1544-6115.1248
Hui Zou, The adaptive lasso and its oracle properties Journal of the American Statistical Association. ,vol. 101, pp. 1418- 1429 ,(2006) , 10.1198/016214506000000735
Carlos M. Carvalho, Jeffrey Chang, Joseph E. Lucas, Joseph R. Nevins, Quanli Wang, Mike West, High-dimensional sparse factor modeling: Applications in gene expression genomics Journal of the American Statistical Association. ,vol. 103, pp. 1438- 1456 ,(2008) , 10.1198/016214508000000869