The identification of major histocompatibility complex (MHC)-binding peptides in mass spectrometry (MS)-based immunopeptideomics relies largely on database search engines developed for proteomics data analysis. However, because immunopeptidomics experiments do not involve enzymatic digestion at specific residues, an inflated search space leads to a high false positive rate and low sensitivity in peptide identification. In order to improve the sensitivity and reliability of peptide identification, a post-processing tool named DeepRescore is developed. DeepRescore combines peptide features derived from deep learning predictions, namely accurate retention timeand MS/MS spectra predictions, with previously used features to rescore peptide-spectrum matches. Using two public immunopeptidomics datasets, it is shown that rescoring by DeepRescore increases both the sensitivity and reliability of MHC-binding peptide and neoantigen identifications compared to existing methods. It is also shown that the performance improvement is, to a large extent, driven by the deep learning-derived features. DeepRescore is developed using NextFlow and Docker and is available at https://github.com/bzhanglab/DeepRescore.
Keywords: bioinformatics; deep learning; immunopeptidomics; proteomics; retention time.
© 2020 Wiley-VCH GmbH.