Application of machine learning algorithms to predict coronary artery calcification with a sibship-based design

作者: Yan V. Sun , Lawrence F. Bielak , Patricia A. Peyser , Stephen T. Turner , Patrick F. Sheedy

DOI: 10.1002/GEPI.20309

关键词: Artificial intelligenceGenetic epidemiologyPercentileSubclinical infectionBody mass indexLipoprotein particleRandom forestMachine learningAlgorithmSingle-nucleotide polymorphismHomocysteineMedicineGenetics(clinical)Epidemiology

摘要: As part of the Genetic Epidemiology Network Arteriopathy study, hypertensive non-Hispanic White sibships were screened using 471 single nucleotide polymorphisms (SNPs) to identify genes influencing coronary artery calcification (CAC) measured by computed tomography. Individuals with detectable CAC and quantity Z70th age- sexspecific percentile classified as having a high burden compared individuals o70th percentile. Two sibs from each sibship randomly chosen divided into two data sets, 360 unrelated individuals. Within set, we applied machine learning algorithms, Random Forests RuleFit, best predictors among 17 risk factors SNPs. Using five-fold cross-validation, both methods had 70% sensitivity 60% specificity. Prediction accuracies significantly different random predictions (P-valueo0.001) based on 1,000 permutation tests. Predictability 287 tagSNPs was good all For Forests, top 50 predictors, same eight 15 found in sets while 12 for RuleFit. Replicable effects (in GPR35 NOS3) (age, body mass index, sex, serum glucose, high-density lipoprotein cholesterol, systolic blood pressure, homocysteine, triglycerides, fibrinogen, Lp(a) low-density particle size) identified methods. This study illustrates how can be used important, replicable subclinical atherosclerosis. Genet. Epidemiol. 32:350–360, 2008. r 2008 Wiley-Liss, Inc.

参考文章(46)
Matthew J. Budoff, Leslee J. Shaw, Sandy T. Liu, Steven R. Weinstein, Philip H. Tseng, Ferdinand R. Flores, Tracy Q. Callister, Paolo Raggi, Daniel S. Berman, Tristen P. Mosler, Long-Term Prognosis Associated With Coronary Calcification: Observations From a Registry of 25,253 Patients Journal of the American College of Cardiology. ,vol. 49, pp. 1860- 1870 ,(2007) , 10.1016/J.JACC.2006.10.079
Kevin L. Gunderson, Frank J. Steemers, Hongi Ren, Pauline Ng, Lixin Zhou, Chan Tsan, Weihua Chang, Dave Bullis, Joe Musmacker, Christine King, Lori L. Lebruska, David Barker, Arnold Oliphant, Kenneth M. Kuhn, Richard Shen, Whole-genome Genotyping Methods in Enzymology. ,vol. 410, pp. 359- 376 ,(2006) , 10.1016/S0076-6879(06)10017-8
Carolin Strobl, Anne-Laure Boulesteix, Achim Zeileis, Torsten Hothorn, Bias in random forest variable importance measures: Illustrations, sources and a solution BMC Bioinformatics. ,vol. 8, pp. 25- 25 ,(2007) , 10.1186/1471-2105-8-25
Daniel M Hoefner, Shannon D Hodel, John F O’Brien, Earl L Branum, Deborah Sun, Irene Meissner, Joseph P McConnell, Development of a Rapid, Quantitative Method for LDL Subfractionation with Use of the Quantimetrix Lipoprint LDL System Clinical Chemistry. ,vol. 47, pp. 266- 274 ,(2001) , 10.1093/CLINCHEM/47.2.266
Mark J Magera, Jean M Lacey, Bruno Casetta, Piero Rinaldo, Method for the Determination of Total Homocysteine in Plasma and Urine by Stable Isotope Dilution and Electrospray Tandem Mass Spectrometry Clinical Chemistry. ,vol. 45, pp. 1517- 1522 ,(1999) , 10.1093/CLINCHEM/45.9.1517
Lewis Wexler, Bruce Brundage, John Crouse, Robert Detrano, Valentin Fuster, Jamshid Maddahi, John Rumberger, William Stanford, Richard White, Kathryn Taubert, Coronary artery calcification: pathophysiology, epidemiology, imaging methods, and clinical implications. A statement for health professionals from the American Heart Association. Writing Group. Circulation. ,vol. 94, pp. 1175- 1192 ,(1996) , 10.1161/01.CIR.94.5.1175
A. Clauss, Rapid physiological coagulation method in determination of fibrinogen Acta Haematologica. ,vol. 17, pp. 237- 246 ,(1957) , 10.1159/000205234
Jinghong Wang, Nicole Simonavicius, Xiaosu Wu, Gayathri Swaminath, Jeff Reagan, Hui Tian, Lei Ling, Kynurenic Acid as a Ligand for Orphan G Protein-coupled Receptor GPR35 Journal of Biological Chemistry. ,vol. 281, pp. 22021- 22028 ,(2006) , 10.1074/JBC.M603503200
Daniel M. Levine, Betty-Jane Sloan, Joan E. Donner, Jeffrey D. Lorenz, Rollin H. Heinzerling, Automated measurement of lipoprotein(a) by immunoturbidimetric analysis International Journal of Clinical & Laboratory Research. ,vol. 22, pp. 173- 178 ,(1992) , 10.1007/BF02591419