Probabilistic prediction of unknown metabolic and signal-transduction networks

Genetics. 2001 Nov;159(3):1291-8. doi: 10.1093/genetics/159.3.1291.

Abstract

Regulatory networks provide control over complex cell behavior in all kingdoms of life. Here we describe a statistical model, based on representing proteins as collections of domains or motifs, which predicts unknown molecular interactions within these biological networks. Using known protein-protein interactions of Saccharomyces cerevisiae as training data, we were able to predict the links within this network with only 7% false-negative and 10% false-positive error rates. We also use Markov chain Monte Carlo simulation for the prediction of networks with maximum probability under our model. This model can be applied across species, where interaction data from one (or several) species can be used to infer interactions in another. In addition, the model is extensible and can be analogously applied to other molecular data (e.g., DNA sequences).

MeSH terms

  • Amino Acid Motifs
  • Genetic Techniques*
  • Models, Genetic*
  • Models, Statistical
  • Monte Carlo Method
  • Plants / genetics
  • Protein Binding
  • Protein Structure, Tertiary
  • Saccharomyces cerevisiae / genetics*
  • Sequence Analysis, DNA
  • Signal Transduction*
  • Software