An integrated pipeline for the genome-wide analysis of transcription factor binding sites from ChIP-Seq

PLoS One. 2011 Feb 16;6(2):e16432. doi: 10.1371/journal.pone.0016432.

Abstract

ChIP-Seq has become the standard method for genome-wide profiling DNA association of transcription factors. To simplify analyzing and interpreting ChIP-Seq data, which typically involves using multiple applications, we describe an integrated, open source, R-based analysis pipeline. The pipeline addresses data input, peak detection, sequence and motif analysis, visualization, and data export, and can readily be extended via other R and Bioconductor packages. Using a standard multicore computer, it can be used with datasets consisting of tens of thousands of enriched regions. We demonstrate its effectiveness on published human ChIP-Seq datasets for FOXA1, ER, CTCF and STAT1, where it detected co-occurring motifs that were consistent with the literature but not detected by other methods. Our pipeline provides the first complete set of Bioconductor tools for sequence and motif analysis of ChIP-Seq and ChIP-chip data.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, N.I.H., Intramural
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms
  • Base Sequence
  • Binding Sites
  • CCCTC-Binding Factor
  • Chromatin / chemistry
  • Chromatin / metabolism*
  • Chromatin Immunoprecipitation / methods*
  • Chromosome Mapping / methods*
  • HeLa Cells
  • Hepatocyte Nuclear Factor 3-alpha / metabolism
  • Humans
  • Molecular Sequence Data
  • Protein Binding
  • Repressor Proteins / metabolism
  • STAT1 Transcription Factor / metabolism
  • Sequence Analysis, DNA / methods
  • Sequence Homology
  • Systems Integration
  • Transcription Factors / metabolism*
  • Tumor Cells, Cultured

Substances

  • CCCTC-Binding Factor
  • CTCF protein, human
  • Chromatin
  • FOXA1 protein, human
  • Hepatocyte Nuclear Factor 3-alpha
  • Repressor Proteins
  • STAT1 Transcription Factor
  • Transcription Factors