Genome-wide identification of genes regulating DNA methylation using genetic anchors for causal inference

Genome Biol. 2020 Aug 28;21(1):220. doi: 10.1186/s13059-020-02114-z.

Abstract

Background: DNA methylation is a key epigenetic modification in human development and disease, yet there is limited understanding of its highly coordinated regulation. Here, we identify 818 genes that affect DNA methylation patterns in blood using large-scale population genomics data.

Results: By employing genetic instruments as causal anchors, we establish directed associations between gene expression and distant DNA methylation levels, while ensuring specificity of the associations by correcting for linkage disequilibrium and pleiotropy among neighboring genes. The identified genes are enriched for transcription factors, of which many consistently increased or decreased DNA methylation levels at multiple CpG sites. In addition, we show that a substantial number of transcription factors affected DNA methylation at their experimentally determined binding sites. We also observe genes encoding proteins with heterogenous functions that have widespread effects on DNA methylation, e.g., NFKBIE, CDCA7(L), and NLRC5, and for several examples, we suggest plausible mechanisms underlying their effect on DNA methylation.

Conclusion: We report hundreds of genes that affect DNA methylation and provide key insights in the principles underlying epigenetic regulation.

Keywords: Causal inference; Chromatin; DNA methylation; Epigenetic regulation; Functional genomics; Genetic instrumental variable; Pleiotropy; Transcription factor.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA Methylation*
  • Endopeptidases / genetics
  • Epigenesis, Genetic*
  • Gene Expression
  • Genetic Pleiotropy
  • Genome-Wide Association Study*
  • Genomics
  • Humans
  • I-kappa B Proteins / genetics
  • Intracellular Signaling Peptides and Proteins / genetics
  • Nuclear Proteins / genetics
  • Proto-Oncogene Proteins / genetics
  • Repressor Proteins / genetics
  • Transcription Factors / genetics

Substances

  • CDCA7 protein, human
  • CDCA7L protein, human
  • I-kappa B Proteins
  • Intracellular Signaling Peptides and Proteins
  • NFKBIE protein, human
  • NLRC5 protein, human
  • Nuclear Proteins
  • Proto-Oncogene Proteins
  • Repressor Proteins
  • Transcription Factors
  • Endopeptidases
  • SENP7 protein, human