A method for predicting protein-protein interaction types

PLoS One. 2014 Mar 13;9(3):e90904. doi: 10.1371/journal.pone.0090904. eCollection 2014.

Abstract

Protein-protein interactions (PPIs) govern basic cellular processes through signal transduction and complex formation. The diversity of those processes gives rise to a remarkable diversity of interactions types, ranging from transient phosphorylation interactions to stable covalent bonding. Despite our increasing knowledge on PPIs in humans and other species, their types remain relatively unexplored and few annotations of types exist in public databases. Here, we propose the first method for systematic prediction of PPI type based solely on the techniques by which the interaction was detected. We show that different detection methods are better suited for detecting specific types. We apply our method to ten interaction types on a large scale human PPI dataset. We evaluate the performance of the method using both internal cross validation and external data sources. In cross validation, we obtain an area under receiver operating characteristic (ROC) curve ranging from 0.65 to 0.97 with an average of 0.84 across the predicted types. Comparing the predicted interaction types to external data sources, we obtained significant agreements for phosphorylation and ubiquitination interactions, with hypergeometric p-value = 2.3e(-54) and 5.6e(-28) respectively. We examine the biological relevance of our predictions using known signaling pathways and chart the abundance of interaction types in cell processes. Finally, we investigate the cross-relations between different interaction types within the network and characterize the discovered patterns, or motifs. We expect the resulting annotated network to facilitate the reconstruction of process-specific subnetworks and assist in predicting protein function or interaction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Motifs
  • Area Under Curve
  • Computational Biology / methods*
  • Databases, Protein
  • Humans
  • Logistic Models
  • Phosphorylation
  • Protein Interaction Mapping / methods*
  • Proteins / metabolism
  • ROC Curve
  • Regression Analysis
  • Signal Transduction
  • Software
  • Ubiquitination

Substances

  • Proteins

Grants and funding

YS was supported in part by a fellowship from the Edmond J. Safra Center for Bioinformatics at Tel Aviv University. RS was supported by an I-CORE Program of the Planning and Budgeting Committee and The Israel Science Foundation (grant no. 757/12). MK was supported by the Israel Science Foundation, the Israel Cancer Research Fund and the Israel Cancer Association. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.