Regression for skewed biomarker outcomes subject to pooling.

作者: Emily M. Mitchell , Robert H. Lyles , Amita K. Manatunga , Michelle Danaher , Neil J. Perkins

DOI: 10.1111/BIOM.12134

关键词: Linear modelPoolingStatisticsCovariateRegressionRegression analysisStandard errorLogistic regressionEfficiencyEconometricsMathematics

摘要: Epidemiological studies involving biomarkers are often hindered by prohibitively expensive laboratory tests. Strategically pooling specimens prior to performing these lab assays has been shown effectively reduce cost with minimal information loss in a logistic regression setting. When the goal is perform continuous biomarker as outcome, analysis of pooled may not be straightforward, particularly if outcome right-skewed. In such cases, we demonstrate that slight modification standard multiple linear model for poolwise data can provide valid and precise coefficient estimates when pools formed combining biospecimens from subjects identical covariate values. x-homogeneous cannot formed, propose Monte Carlo expectation maximization (MCEM) algorithm compute maximum likelihood (MLEs). Simulation analytical methods essentially unbiased parameters well their errors appropriate assumptions met. Furthermore, show how one utilize fully observed inform strategy, yielding high level statistical efficiency at fraction total cost.

参考文章(30)
Roderick Little, Donald Rubin, Statistical Analysis with Missing Data, Second Edition Wiley Series in Probability and Statistics. ,(2019) , 10.1002/9781119482260
J.C.S. Santos Filho, M.D. Yacoub, P. Cardieri, Highly accurate range-adaptive lognormal approximation to lognormal sum distributions Electronics Letters. ,vol. 42, pp. 361- 363 ,(2006) , 10.1049/EL:20064091
Chang-Xing Ma, Albert Vexler, Enrique F. Schisterman, Lili Tian, Cost-efficient designs based on linearly associated biomarkers Journal of Applied Statistics. ,vol. 38, pp. 2739- 2750 ,(2011) , 10.1080/02664763.2011.567254
Robert Dorfman, The Detection of Defective Members of Large Populations Annals of Mathematical Statistics. ,vol. 14, pp. 436- 440 ,(1943) , 10.1214/AOMS/1177731363
Shou-Jen Lan, Chung-Cheng Hsieh, Yea-Yin Yen, Pooling Strategies for Screening Blood in Areas with Low Prevalence of HIV Biometrical Journal. ,vol. 35, pp. 553- 565 ,(1993) , 10.1002/BIMJ.4710350505
Pamela M. Marcus, Polly A. Newcomb, Terry Young, Barry E. Storer, The association of reproductive and menstrual characteristics and colon and rectal cancer risk in Wisconsin women Annals of Epidemiology. ,vol. 5, pp. 303- 309 ,(1995) , 10.1016/1047-2797(94)00097-D
Thomas A. Louis, Finding the Observed Information Matrix When Using the EM Algorithm Journal of the royal statistical society series b-methodological. ,vol. 44, pp. 226- 233 ,(1982) , 10.1111/J.2517-6161.1982.TB01203.X
Enrique F Schisterman, Albert Vexler, None, To pool or not to pool, from whether to when: applications of pooling to biospecimens subject to a limit of detection. Paediatric and Perinatal Epidemiology. ,vol. 22, pp. 486- 496 ,(2008) , 10.1111/J.1365-3016.2008.00956.X
Brian W. Whitcomb, Neil J. Perkins, Zhiwei Zhang, Aijun Ye, Robert H. Lyles, Assessment of skewed exposure in case‐control studies with pooling Statistics in Medicine. ,vol. 31, pp. 2461- 2472 ,(2012) , 10.1002/SIM.5351