Clustering-local-unique-enriched-signals (CLUES) promotes identification of novel regulators of ES cell self-renewal and pluripotency

PLoS One. 2018 Nov 6;13(11):e0206844. doi: 10.1371/journal.pone.0206844. eCollection 2018.

Abstract

Background: Key regulators of developmental processes can be prioritized through integrated analysis of ChIP-Seq data of master transcriptional factors (TFs) such as Nanog and Oct4, active histone modifications (HMs) such as H3K4me3 and H3K27ac, and repressive HMs such as H3K27me3. Recent studies show that broad enrichment signals such as super-enhancers and broad H3K4me3 enrichment signals play more dominant roles than short enrichment signals of the master TFs and H3K4me3 in epigenetic regulatory mechanism. Besides the broad enrichment signals, up to ten thousands of short enrichment signals of these TFs and HMs exist in genome. Prioritization of these broad enrichment signals from ChIP-Seq data is a prerequisite for such integrated analysis.

Results: Here, we present a method named Clustering-Local-Unique-Enriched-Signals (CLUES), which uses an adaptive-size-windows strategy to identify enriched regions (ERs) and cluster them into broad enrichment signals. Tested on 62 ENCODE ChIP-Seq datasets of Ctcf and Nrsf, CLUES performs equally well as MACS2 regarding prioritization of ERs with the TF's motif. Tested on 165 ENCODE ChIP-Seq datasets of H3K4me3, H3K27me3, and H3K36me3, CLUES performs better than existing algorithms on prioritizing broad enrichment signals implicating cell functions influenced by epigenetic regulatory mechanism in cells. Most importantly, CLUES helps to confirm several novel regulators of mouse ES cell self-renewal and pluripotency through integrated analysis of prioritized broad enrichment signals of H3K4me3, H3K27me3, Nanog and Oct4 with the support of a CRISPR/Cas9 negative selection genetic screen.

Conclusions: CLUES holds promise for prioritizing broad enrichment signals from ChIP-Seq data. The download site for CLUES is https://github.com/Wuchao1984/CLUESv1.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • CRISPR-Cas Systems / genetics
  • Cell Self Renewal / genetics*
  • Chromatin Immunoprecipitation
  • Embryonic Stem Cells*
  • Epigenesis, Genetic*
  • Histone Code / genetics
  • Histone-Lysine N-Methyltransferase / genetics*
  • Mice
  • Promoter Regions, Genetic
  • Protein Processing, Post-Translational
  • Regulatory Sequences, Nucleic Acid

Substances

  • Histone-Lysine N-Methyltransferase

Grants and funding

Junling Jia is supported by the Zhejiang Provincial Natural Science Funds (R15C060001, http://www.zjnsf.gov.cn/). Jing Zhu is from Beijing Ming-tian Genetics Ltd. Beijing Ming-tian Genetics Ltd. provided support in the form of salary for author JZ, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific role of this author is articulated in the ‘author contributions’ section.