COE: a general approach for efficient genome-wide two-locus epistasis test in disease association study.

作者: Xiang Zhang , Feng Pan , Yuying Xie , Fei Zou , Wei Wang

DOI: 10.1089/CMB.2009.0155

关键词:

摘要: The availability of high-density single nucleotide polymorphisms (SNPs) data has made genome-wide association study computationally challenging. Two-locus epistasis (gene-gene interaction) detection attracted great research interest as a promising method for genetic analysis complex diseases. In this article, we propose general approach, COE, efficient large scale gene-gene interaction analysis, which supports wide range tests. particular, show that many commonly used statistics are convex functions. From the observed values events in two-locus test, can develop an upper bound test value. Such only depends on single-locus and genotype SNP-pair. We thus group index SNP-pairs by their genotypes. This indexing structure benefit computation all statistics. Utilizing structure, prune most without compromising optimality result. Our approach is especially permutation test. Extensive experiments demonstrate our provides orders magnitude performance improvement over brute force approach.

参考文章(16)
Charles Erlichman, Daniel J. Sargent, New Treatment Options for Colorectal Cancer The New England Journal of Medicine. ,vol. 351, pp. 391- 392 ,(2004) , 10.1056/NEJME048151
Claire M Wade, Mark J Daly, Genetic variation in laboratory mice Nature Genetics. ,vol. 37, pp. 1175- 1180 ,(2005) , 10.1038/NG1666
Stefan Böhringer, Cornelia Hardt, Bianca Miterski, Ansgar Steland, Jörg T Epplen, Multilocus statistics to uncover epistasis and heterogeneity in complex diseases: revisiting a set of multiple sclerosis data European Journal of Human Genetics. ,vol. 11, pp. 573- 584 ,(2003) , 10.1038/SJ.EJHG.5201008
Michael N Weedon, Guillaume Lettre, Rachel M Freathy, Cecilia M Lindgren, Benjamin F Voight, John RB Perry, Katherine S Elliott, Rachel Hackett, Candace Guiducci, Beverley Shields, Eleftheria Zeggini, Hana Lango, Valeriya Lyssenko, Nicholas J Timpson, Noel P Burtt, Nigel W Rayner, Richa Saxena, Kristin Ardlie, Jonathan H Tobias, Andrew R Ness, Susan M Ring, Colin NA Palmer, Andrew D Morris, Leena Peltonen, Veikko Salomaa, Diabetes Genetics Initiative, Wellcome Trust Case Control Consortium, George Davey Smith, Leif C Groop, Andrew T Hattersley, Mark I McCarthy, Joel N Hirschhorn, Timothy M Frayling, None, A common variant of HMGA2 is associated with adult and childhood height in the general population Nature Genetics. ,vol. 39, pp. 1245- 1250 ,(2007) , 10.1038/NG2121
Allen D. Roses, The genome era begins... Nature Genetics. ,vol. 33, pp. 217- 217 ,(2003) , 10.1038/NG1110
Kouichi Ozaki, Yozo Ohnishi, Aritoshi Iida, Akihiko Sekine, Ryo Yamada, Tatsuhiko Tsunoda, Hiroshi Sato, Hideyuki Sato, Masatsugu Hori, Yusuke Nakamura, Toshihiro Tanaka, Functional SNPs in the lymphotoxin-alpha gene that are associated with susceptibility to myocardial infarction. Nature Genetics. ,vol. 32, pp. 650- 654 ,(2002) , 10.1038/NG1047
Daniel Segrè, Alexander DeLuna, George M Church, Roy Kishony, Modular epistasis in yeast metabolism. Nature Genetics. ,vol. 37, pp. 77- 83 ,(2005) , 10.1038/NG1489
Jinying Zhao, Eric Boerwinkle, Momiao Xiong, An Entropy-Based Statistic for Genomewide Association Studies American Journal of Human Genetics. ,vol. 77, pp. 27- 40 ,(2005) , 10.1086/431243
Josephine Hoh, Jurg Ott, Mathematical multi-locus approaches to localizing complex human trait genes Nature Reviews Genetics. ,vol. 4, pp. 701- 709 ,(2003) , 10.1038/NRG1155
Rebecca W. Doerge, Mapping and analysis of quantitative trait loci in experimental populations Nature Reviews Genetics. ,vol. 3, pp. 43- 52 ,(2002) , 10.1038/NRG703