Negative information for motif discovery

K T Takusagawa; D K Gifford

doi:10.1142/9789812704856_0034

Negative information for motif discovery

Pac Symp Biocomput. 2004:360-71. doi: 10.1142/9789812704856_0034.

Authors

K T Takusagawa¹, D K Gifford

Affiliation

¹ Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA. kenta@mit.edu

PMID: 14992517
DOI: 10.1142/9789812704856_0034

Abstract

We discuss a method of combining genome-wide transcription factor binding data, gene expression data, and genome sequence data for the purpose of motif discovery in S. cerevisiae. Within the word-counting algorithmic approach to motif discovery, we present a method of incorporating information from negative intergenic regions where a transcription factor is thought not to bind, and a statistical significance measure which account for intergenic regions of different lengths. Our results demonstrate that our method performs slightly better than other motif discovery algorithms. Finally, we present significant potential new motifs discovered by the algorithm.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Base Sequence
Binding Sites
Computational Biology*
Consensus Sequence
DNA, Fungal / genetics
DNA, Fungal / metabolism
DNA, Intergenic
Models, Genetic
Saccharomyces cerevisiae Proteins / genetics
Saccharomyces cerevisiae Proteins / metabolism
Transcription Factors / genetics*
Transcription Factors / metabolism*

Substances

DNA, Fungal
DNA, Intergenic
Saccharomyces cerevisiae Proteins
Transcription Factors