Distinguishing direct versus indirect transcription factor-DNA interactions

Genome Res. 2009 Nov;19(11):2090-100. doi: 10.1101/gr.094144.109. Epub 2009 Aug 3.

Abstract

Transcriptional regulation is largely enacted by transcription factors (TFs) binding DNA. Large numbers of TF binding motifs have been revealed by ChIP-chip experiments followed by computational DNA motif discovery. However, the success of motif discovery algorithms has been limited when applied to sequences bound in vivo (such as those identified by ChIP-chip) because the observed TF-DNA interactions are not necessarily direct: Some TFs predominantly associate with DNA indirectly through protein partners, while others exhibit both direct and indirect binding. Here, we present the first method for distinguishing between direct and indirect TF-DNA interactions, integrating in vivo TF binding data, in vivo nucleosome occupancy data, and motifs from in vitro protein binding microarray experiments. When applied to yeast ChIP-chip data, our method reveals that only 48% of the data sets can be readily explained by direct binding of the profiled TF, while 16% can be explained by indirect DNA binding. In the remaining 36%, none of the motifs used in our analysis was able to explain the ChIP-chip data, either because the data were too noisy or because the set of motifs was incomplete. As more in vitro TF DNA binding motifs become available, our method could be used to build a complete catalog of direct and indirect TF-DNA interactions. Our method is not restricted to yeast or to ChIP-chip data, but can be applied in any system for which both in vivo binding data and in vitro DNA binding motifs are available.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Base Sequence
  • Binding Sites / genetics
  • Chromatin Immunoprecipitation
  • DNA, Fungal / metabolism*
  • DNA-Binding Proteins / genetics
  • DNA-Binding Proteins / metabolism
  • Forkhead Transcription Factors
  • Models, Biological
  • Nucleosomes / metabolism
  • Protein Binding
  • Protein Interaction Domains and Motifs / genetics
  • Protein Interaction Mapping / methods
  • Reproducibility of Results
  • Saccharomyces cerevisiae / genetics
  • Saccharomyces cerevisiae / metabolism*
  • Saccharomyces cerevisiae Proteins / genetics
  • Saccharomyces cerevisiae Proteins / metabolism*
  • Shelterin Complex
  • Telomere-Binding Proteins / genetics
  • Telomere-Binding Proteins / metabolism
  • Transcription Factors / genetics
  • Transcription Factors / metabolism*

Substances

  • DNA, Fungal
  • DNA-Binding Proteins
  • FHL1 protein, S cerevisiae
  • Forkhead Transcription Factors
  • Nucleosomes
  • RAP1 protein, S cerevisiae
  • SFP1 protein, S cerevisiae
  • STE12 protein, S cerevisiae
  • Saccharomyces cerevisiae Proteins
  • Shelterin Complex
  • TEC1 protein, S cerevisiae
  • Telomere-Binding Proteins
  • Transcription Factors