Predicting protein interaction network perturbation by alternative splicing with semi-supervised learning

Cell Rep. 2021 Nov 23;37(8):110045. doi: 10.1016/j.celrep.2021.110045.

Abstract

Alternative splicing introduces an additional layer of protein diversity and complexity in regulating cellular functions that can be specific to the tissue and cell type, physiological state of a cell, or disease phenotype. Recent high-throughput experimental studies have illuminated the functional role of splicing events through rewiring protein-protein interactions; however, the extent to which the macromolecular interactions are affected by alternative splicing has yet to be fully understood. In silico methods provide a fast and cheap alternative to interrogating functional characteristics of thousands of alternatively spliced isoforms. Here, we develop an accurate feature-based machine learning approach that predicts whether a protein-protein interaction carried out by a reference isoform is perturbed by an alternatively spliced isoform. Our method, called the alternatively spliced interactions prediction (ALT-IN) tool, is compared with the state-of-the-art PPI prediction tools and shows superior performance, achieving 0.92 in precision and recall values.

Keywords: alternative splicing; machine learning; protein-protein interactions; semi-supervised learning.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Alternative Splicing / genetics
  • Computational Biology / methods
  • Forecasting / methods*
  • Humans
  • Protein Interaction Mapping / methods*
  • Protein Interaction Maps / genetics
  • Protein Interaction Maps / physiology*
  • Protein Isoforms / analysis
  • Protein Isoforms / metabolism
  • RNA Splicing
  • Supervised Machine Learning

Substances

  • Protein Isoforms