Evaluation of algorithms for protein identification from sequence databases using mass spectrometry data

Daniel C Chamrad; Gerhard Körting; Kai Stühler; Helmut E Meyer; Joachim Klose; Martin Blüggel

doi:10.1002/pmic.200300612

Evaluation of algorithms for protein identification from sequence databases using mass spectrometry data

Proteomics. 2004 Mar;4(3):619-28. doi: 10.1002/pmic.200300612.

Authors

Daniel C Chamrad¹, Gerhard Körting, Kai Stühler, Helmut E Meyer, Joachim Klose, Martin Blüggel

Affiliation

¹ Protagen, Dortmund, Germany.

PMID: 14997485
DOI: 10.1002/pmic.200300612

Abstract

In this work, the commonly used algorithms for mass spectrometry based protein identification, Mascot, MS-Fit, ProFound and SEQUEST, were studied in respect to the selectivity and sensitivity of their searches. The influence of various search parameters were also investigated. Approximately 6600 searches were performed using different search engines with several search parameters to establish a statistical basis. The applied mass spectrometric data set was chosen from a current proteome study. The huge amount of data could only be handled with computational assistance. We present a software solution for fully automated triggering of several peptide mass fingerprinting (PMF) and peptide fragmentation fingerprinting (PFF) algorithms. The development of this high-throughput method made an intensive evaluation based on data acquired in a typical proteome project possible. Previous evaluations of PMF and PFF algorithms were mainly based on simulations.

MeSH terms

Algorithms
Animals
Binding Sites
Computational Biology
Databases as Topic*
Ions
Mass Spectrometry / methods*
Mice
Proteome
Proteomics / methods*
Software

Substances

Ions
Proteome