Negative information for motif discovery

Pac Symp Biocomput. 2004:360-71. doi: 10.1142/9789812704856_0034.

Abstract

We discuss a method of combining genome-wide transcription factor binding data, gene expression data, and genome sequence data for the purpose of motif discovery in S. cerevisiae. Within the word-counting algorithmic approach to motif discovery, we present a method of incorporating information from negative intergenic regions where a transcription factor is thought not to bind, and a statistical significance measure which account for intergenic regions of different lengths. Our results demonstrate that our method performs slightly better than other motif discovery algorithms. Finally, we present significant potential new motifs discovered by the algorithm.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Base Sequence
  • Binding Sites
  • Computational Biology*
  • Consensus Sequence
  • DNA, Fungal / genetics
  • DNA, Fungal / metabolism
  • DNA, Intergenic
  • Models, Genetic
  • Saccharomyces cerevisiae Proteins / genetics
  • Saccharomyces cerevisiae Proteins / metabolism
  • Transcription Factors / genetics*
  • Transcription Factors / metabolism*

Substances

  • DNA, Fungal
  • DNA, Intergenic
  • Saccharomyces cerevisiae Proteins
  • Transcription Factors