Assessing computational methods for predicting protein stability upon mutation: good on average but not in the details

Protein Eng Des Sel. 2009 Sep;22(9):553-60. doi: 10.1093/protein/gzp030. Epub 2009 Jun 26.

Abstract

Methods for protein modeling and design advanced rapidly in recent years. At the heart of these computational methods is an energy function that calculates the free energy of the system. Many of these functions were also developed to estimate the consequence of mutation on protein stability or binding affinity. In the current study, we chose six different methods that were previously reported as being able to predict the change in protein stability (DeltaDeltaG) upon mutation: CC/PBSA, EGAD, FoldX, I-Mutant2.0, Rosetta and Hunter. We evaluated their performance on a large set of 2156 single mutations, avoiding for each program the mutations used for training. The correlation coefficients between experimental and predicted DeltaDeltaG values were in the range of 0.59 for the best and 0.26 for the worst performing method. All the tested computational methods showed a correct trend in their predictions, but failed in providing the precise values. This is not due to lack in precision of the experimental data, which showed a correlation coefficient of 0.86 between different measurements. Combining the methods did not significantly improve prediction accuracy compared to a single method. These results suggest that there is still room for improvement, which is crucial if we want forcefields to perform better in their various tasks.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Databases, Protein
  • Linear Models
  • Mutation*
  • Protein Engineering / methods*
  • Protein Stability*
  • Proteins / chemistry*
  • Proteins / genetics*
  • Thermodynamics

Substances

  • Proteins