Splice-Junction-Based Mapping of Alternative Isoforms in the Human Proteome

Cell Rep. 2019 Dec 10;29(11):3751-3765.e5. doi: 10.1016/j.celrep.2019.11.026.

Abstract

The protein-level translational status and function of many alternative splicing events remain poorly understood. We use an RNA sequencing (RNA-seq)-guided proteomics method to identify protein alternative splicing isoforms in the human proteome by constructing tissue-specific protein databases that prioritize transcript splice junction pairs with high translational potential. Using the custom databases to reanalyze ∼80 million mass spectra in public proteomics datasets, we identify more than 1,500 noncanonical protein isoforms across 12 human tissues, including ∼400 sequences undocumented on TrEMBL and RefSeq databases. We apply the method to original quantitative mass spectrometry experiments and observe widespread isoform regulation during human induced pluripotent stem cell cardiomyocyte differentiation. On a proteome scale, alternative isoform regions overlap frequently with disordered sequences and post-translational modification sites, suggesting that alternative splicing may regulate protein function through modulating intrinsically disordered regions. The described approach may help elucidate functional consequences of alternative splicing and expand the scope of proteomics investigations in various systems.

Keywords: alternative splicing; cardiomyocyte differentiation; human proteome; induced pluripotent stem cells; intrinsically disordered region; mass spectrometry; protein isoforms; proteoforms; proteomics; splice isoforms.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Alternative Splicing*
  • Cell Differentiation
  • Cell Line
  • Female
  • Humans
  • Induced Pluripotent Stem Cells / cytology
  • Induced Pluripotent Stem Cells / metabolism
  • Male
  • Mass Spectrometry / methods
  • Myocytes, Cardiac / cytology
  • Myocytes, Cardiac / metabolism
  • Protein Isoforms / genetics
  • Protein Isoforms / metabolism
  • Proteome / genetics*
  • Proteome / metabolism
  • Proteomics / methods*
  • RNA-Seq / methods

Substances

  • Protein Isoforms
  • Proteome