Identifying candidate causal variants via trans-population fine-mapping

Genet Epidemiol. 2010 Nov;34(7):653-64. doi: 10.1002/gepi.20522.

Abstract

Genome-wide association studies have discovered and confirmed a large number of loci that are implicated with disease susceptibility and severity. Polymorphisms that emerged from these studies are mostly indirectly associated to the phenotype, and the natural progression is to identify the causal variants that are functionally responsible for these association signals. Long stretches of high linkage disequilibrium (LD) benefitted the initial discovery phase in a genome-wide scan, allowing commercial genotyping products with imperfect coverage to detect genomic regions genuinely associated with the phenotype. However, regions of high LD confound the fine-mapping phase, as markers that are perfectly correlated to the causal variants display similar evidence of phenotypic association, hampering the process of differentiating the functional polymorphisms from neighboring surrogates. Here, we explore the potential of integrating information across different populations for narrowing the candidate region that a causal variant resides in, and compare the efficacy of this process of trans-population fine-mapping with the extent of variation in patterns of LD between the populations. In addition, we explore two different strategies for pooling data across multiple populations for the purpose of prioritizing the rankings of the causal variants. Our results clearly establish the benefits of trans-population analysis in reducing the number of possible candidates for the causal variants, particularly in genomic regions displaying strong evidence of inter-population LD variation. Directly integrating the statistical evidence by summing the test statistics outperforms the standard meta-analytic procedure. These findings have direct relevance to the design and analysis of ongoing fine-mapping studies.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles
  • Case-Control Studies
  • Chromosome Mapping / methods*
  • Computer Simulation
  • Cyclin-Dependent Kinase 5 / genetics
  • Gene Frequency
  • Genetic Predisposition to Disease
  • Genetic Variation*
  • Genome-Wide Association Study
  • Humans
  • Linkage Disequilibrium*
  • Models, Genetic
  • Molecular Epidemiology
  • Polymorphism, Single Nucleotide
  • tRNA Methyltransferases

Substances

  • tRNA Methyltransferases
  • Cyclin-Dependent Kinase 5
  • CDKAL1 protein, human