Improving the sensitivity of MASCOT search results validation by combining new features with Bayesian nonparametric model

Proteomics. 2010 Dec;10(23):4293-300. doi: 10.1002/pmic.200900668.

Abstract

The probability-based search engine MASCOT has been widely used to identify peptides and proteins in shotgun proteomic research. Most subsequent quality control methods filter out ambiguous assignments according to the ion score and thresholds provided by MASCOT. On the basis of target-decoy database search strategy, we evaluated the performance of several filter methods on MASCOT search results and demonstrated that using filter boundaries on two-dimensional feature spaces, the MASCOT ion score and its relative score can improve the sensitivity of the filter process. Furthermore, using a linear combination of several characteristics of the assigned peptides, including the MASCOT scores, 15 previously employed features, and some newly introduced features, we applied a Bayesian nonparametric model to MASCOT search results and validated more correctly identified peptides in control and complex data sets than those could be validated by empirical score thresholds.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem*
  • Discriminant Analysis
  • Humans
  • Linear Models
  • Search Engine / methods*
  • Search Engine / statistics & numerical data
  • Statistics, Nonparametric*