Detecting genetic association through shortest paths in a bidirected graph

  • Masao Ueki
    Biostatistics Center Kurume University Fukuoka Japan
  • Yoshinori Kawasaki
    Department of Statistical Modeling The Institute of Statistical Mathematics, The Graduate University for Advanced Studies Tachikawa Tokyo Japan
  • Gen Tamiya
    Statistical Genetics and Genomics, Tohoku Medical Megabank Organization Tohoku University Aoba‐Ku Sendai Japan

説明

<jats:title>ABSTRACT</jats:title><jats:p>Genome‐wide association studies (GWASs) commonly use marginal association tests for each single‐nucleotide polymorphism (SNP). Because these tests treat SNPs as independent, their power will be suboptimal for detecting SNPs hidden by linkage disequilibrium (LD). One way to improve power is to use a multiple regression model. However, the large number of SNPs preclude simultaneous fitting with multiple regression, and subset regression is infeasible because of an exorbitant number of candidate subsets. We therefore propose a new method for detecting hidden SNPs having significant yet weak marginal association in a multiple regression model. Our method begins by constructing a bidirected graph locally around each SNP that demonstrates a moderately sized marginal association signal, the<jats:italic>focal</jats:italic>SNPs. Vertexes correspond to SNPs, and adjacency between vertexes is defined by an LD measure. Subsequently, the method collects from each graph all shortest paths to the focal SNP. Finally, for each shortest path the method fits a multiple regression model to all the SNPs lying in the path and tests the significance of the regression coefficient corresponding to the terminal SNP in the path. Simulation studies show that the proposed method can detect susceptibility SNPs hidden by LD that go undetected with marginal association testing or with existing multivariate methods. When applied to real GWAS data from the Alzheimer's Disease Neuroimaging Initiative (ADNI), our method detected two groups of SNPs: one in a region containing the<jats:italic>apolipoprotein E</jats:italic>(<jats:italic>APOE</jats:italic>) gene, and another in a region close to the<jats:italic>semaphorin 5A</jats:italic>(<jats:italic>SEMA5A</jats:italic>) gene.</jats:p>

収録刊行物

被引用文献 (1)*注記

もっと見る

参考文献 (30)*注記

もっと見る

関連プロジェクト

もっと見る

問題の指摘

ページトップへ