High-accuracy peptide mass fingerprinting using peak intensity data with machine learning

J Proteome Res. 2008 Jan;7(1):62-9. doi: 10.1021/pr070088g. Epub 2007 Oct 3.

Abstract

For MALDI-TOF mass spectrometry, we show that the intensity of a peptide-ion peak is directly correlated with its sequence, with the residues M, H, P, R, and L having the most substantial effect on ionization. We developed a machine learning approach that exploits this relationship to significantly improve peptide mass fingerprint (PMF) accuracy based on training data sets from both true-positive and false-positive PMF searches. The model's cross-validated accuracy in distinguishing real versus false-positive database search results is 91%, rivaling the accuracy of MS/MS-based protein identification.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Artificial Intelligence*
  • Data Interpretation, Statistical
  • Peptide Mapping / methods*
  • Proteins / analysis
  • Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization
  • Tandem Mass Spectrometry / methods*

Substances

  • Proteins