Whole genome sequencing and imputation in isolated populations identify genetic associations with medically-relevant complex traits

Nat Commun. 2017 May 26:8:15606. doi: 10.1038/ncomms15606.

Abstract

Next-generation association studies can be empowered by sequence-based imputation and by studying founder populations. Here we report ∼9.5 million variants from whole-genome sequencing (WGS) of a Cretan-isolated population, and show enrichment of rare and low-frequency variants with predicted functional consequences. We use a WGS-based imputation approach utilizing 10,422 reference haplotypes to perform genome-wide association analyses and observe 17 genome-wide significant, independent signals, including replicating evidence for association at eight novel low-frequency variant signals. Two novel cardiometabolic associations are at lead variants unique to the founder population sequences: chr16:70790626 (high-density lipoprotein levels beta -1.71 (SE 0.25), P=1.57 × 10-11, effect allele frequency (EAF) 0.006); and rs145556679 (triglycerides levels beta -1.13 (SE 0.17), P=2.53 × 10-11, EAF 0.013). Our findings add empirical support to the contribution of low-frequency variants in complex traits, demonstrate the advantage of including population-specific sequences in imputation panels and exemplify the power gains afforded by population isolates.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Aged
  • Female
  • Gene Frequency / genetics*
  • Genetic Variation
  • Genome, Human / genetics*
  • Genome-Wide Association Study / methods*
  • Greece
  • Haplotypes
  • Humans
  • Male
  • Middle Aged
  • Phenotype
  • Polymorphism, Single Nucleotide
  • White People / genetics*
  • Whole Genome Sequencing / methods