The Function-on-Scalar LASSO with Applications to Longitudinal GWAS

作者: Matthew Reimherr , Rina Foygel Barber , Thomas Schill

DOI:

关键词: Predictor variablesFeature selectionScalar (mathematics)Framingham Heart StudyGenome-wide association studyRegressionFunctional methodsEstimation theoryMathematicsApplied mathematics

摘要: We present a new methodology for simultaneous variable selection and parameter estimation in function-on-scalar regression with an ultra-high dimensional predictor vector. extend the LASSO to functional data both $\textit{dense}$ setting $\textit{sparse}$ setting. provide theoretical guarantees which allow exponential number of variables. Simulations are carried out illustrate compare sparse/functional methods. Using Framingham Heart Study, we demonstrate how our tools can be used genome-wide association studies, finding genetic mutations affect blood pressure therefore important cardiovascular health.

参考文章(33)
Piotr Kokoszka, Lajos Horváth, Inference for Functional Data with Applications ,(2012)
Pascal Sarda, F reric Ferraty, SPLINE ESTIMATORS FOR THE FUNCTIONAL LINEAR MODEL ,(2003)
Peixin Zhao, Liugen Xue, Variable selection for semiparametric varying coefficient partially linear errors-in-variables models Journal of Multivariate Analysis. ,vol. 101, pp. 1872- 1883 ,(2010) , 10.1016/J.JMVA.2010.03.005
Karim Lounici, Massimiliano Pontil, Sara van de Geer, Alexandre B. Tsybakov, Oracle Inequalities and Optimal Inference under Group Sparsity Annals of Statistics. ,vol. 39, pp. 2164- 2204 ,(2011) , 10.1214/11-AOS896
Martin G Larson, Larry D Atwood, Emelia J Benjamin, L Adrienne Cupples, Ralph B D'Agostino, Caroline S Fox, Diddahally R Govindaraju, Chao-Yu Guo, Nancy L Heard-Costa, Shih-Jen Hwang, Joanne M Murabito, Christopher Newton-Cheh, Christopher J O'Donnell, Sudha Seshadri, Ramachandran S Vasan, Thomas J Wang, Philip A Wolf, Daniel Levy, Framingham Heart Study 100K project: genome-wide associations for cardiovascular disease outcomes BMC Medical Genetics. ,vol. 8, pp. 1- 9 ,(2007) , 10.1186/1471-2350-8-S1-S5
Syed S Mahmood, Daniel Levy, Ramachandran S Vasan, Thomas J Wang, The Framingham Heart Study and the epidemiology of cardiovascular disease: a historical perspective The Lancet. ,vol. 383, pp. 999- 1008 ,(2014) , 10.1016/S0140-6736(13)61752-3
Richard Baraniuk, Mark Davenport, Ronald DeVore, Michael Wakin, A Simple Proof of the Restricted Isometry Property for Random Matrices Constructive Approximation. ,vol. 28, pp. 253- 263 ,(2008) , 10.1007/S00365-007-9003-X
Donald R Hoover, John A Rice, Colin O Wu, Li-Ping Yang, Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data Biometrika. ,vol. 85, pp. 809- 822 ,(1998) , 10.1093/BIOMET/85.4.809