Predicting CTCF-mediated chromatin interactions by integrating genomic and epigenomic features

Nat Commun. 2018 Oct 11;9(1):4221. doi: 10.1038/s41467-018-06664-6.

Abstract

The CCCTC-binding zinc-finger protein (CTCF)-mediated network of long-range chromatin interactions is important for genome organization and function. Although this network has been considered largely invariant, we find that it exhibits extensive cell-type-specific interactions that contribute to cell identity. Here, we present Lollipop, a machine-learning framework, which predicts CTCF-mediated long-range interactions using genomic and epigenomic features. Using ChIA-PET data as benchmark, we demonstrate that Lollipop accurately predicts CTCF-mediated chromatin interactions both within and across cell types, and outperforms other methods based only on CTCF motif orientation. Predictions are confirmed computationally and experimentally by Chromatin Conformation Capture (3C). Moreover, our approach identifies other determinants of CTCF-mediated chromatin wiring, such as gene expression within the loops. Our study contributes to a better understanding about the underlying principles of CTCF-mediated chromatin interactions and their impact on gene expression.

Publication types

  • Research Support, N.I.H., Intramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • CCCTC-Binding Factor / metabolism*
  • Cell Line
  • Chromatin / metabolism*
  • Epigenesis, Genetic*
  • Gene Regulatory Networks
  • Genome*
  • Humans
  • ROC Curve

Substances

  • CCCTC-Binding Factor
  • Chromatin