Pocketome via comprehensive identification and classification of ligand binding envelopes

Mol Cell Proteomics. 2005 Jun;4(6):752-61. doi: 10.1074/mcp.M400159-MCP200. Epub 2005 Mar 9.

Abstract

We developed a new computational algorithm for the accurate identification of ligand binding envelopes rather than surface binding sites. We performed a large scale classification of the identified envelopes according to their shape and physicochemical properties. The predicting algorithm, called PocketFinder, uses a transformation of the Lennard-Jones potential calculated from a three-dimensional protein structure and does not require any knowledge about a potential ligand molecule. We validated this algorithm using two systematically collected data sets of ligand binding pockets from complexed (bound) and uncomplexed (apo) structures from the Protein Data Bank, 5616 and 11,510, respectively. As many as 96.8% of experimental binding sites were predicted at better than 50% overlap level. Furthermore 95.0% of the asserted sites from the apo receptors were predicted at the same level. We demonstrate that conformational differences between the apo and bound pockets do not dramatically affect the prediction results. The algorithm can be used to predict ligand binding pockets of uncharacterized protein structures, suggest new allosteric pockets, evaluate feasibility of protein-protein interaction inhibition, and prioritize molecular targets. Finally the data base of the known and predicted binding pockets for the human proteome structures, the human pocketome, was collected and classified. The pocketome can be used for rapid evaluation of possible binding partners of a given chemical compound.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Binding Sites
  • Cluster Analysis
  • Computational Biology / methods*
  • Databases, Factual
  • Ligands
  • Models, Chemical
  • Models, Molecular*
  • Protein Conformation*
  • Proteins / chemistry*
  • Sequence Homology
  • Structure-Activity Relationship

Substances

  • Ligands
  • Proteins