Comparative Analysis of QSAR-based vs. Chemical Similarity Based Predictors of GPCRs Binding Affinity

Man Luo; Xiang S Wang; Alexander Tropsha

doi:10.1002/minf.201500038

Comparative Analysis of QSAR-based vs. Chemical Similarity Based Predictors of GPCRs Binding Affinity

Mol Inform. 2016 Jan;35(1):36-41. doi: 10.1002/minf.201500038. Epub 2015 Oct 23.

Authors

Man Luo¹, Xiang S Wang², Alexander Tropsha³

Affiliations

¹ Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, CB #7360, Beard Hall, Chapel Hill, NC 27599-7360 USA phone: 919-966-2955; fax: 919-966-0204.
² Department of Pharmaceutical Sciences, College of Pharmacy, Howard University, Washington, District of Columbia 20059.
³ Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, CB #7360, Beard Hall, Chapel Hill, NC 27599-7360 USA phone: 919-966-2955; fax: 919-966-0204. alex_tropsha@unc.edu.

PMID: 27491652
DOI: 10.1002/minf.201500038

Abstract

Ligand based virtual screening (LBVS) approaches could be broadly divided into those relying on chemical similarity searches and those employing Quantitative Structure-Activity Relationship (QSAR) models. We have compared the predictive power of these approaches using some datasets of compounds tested against several G-Protein Coupled Receptors (GPCRs). The k-Nearest Neighbors (kNN) QSAR models were built for known ligands of each GPCR target independently, with a fraction of tested ligands for each target set aside as a validation set. The prediction accuracies of QSAR models for making active/inactive calls for compounds in both training and validation sets were compared to those achieved by the Prediction of Activity Spectra for Substances' (PASS) and the Similarity Ensemble Approach (SEA) tools both available online. Models developed with the kNN QSAR method showed the highest predictive power for almost all tested GPCR datasets. The PASS software, which incorporates multiple end-point specific QSAR models demonstrated a moderate predictive power, while SEA, a chemical similarity based approach, had the lowest prediction power. Our studies suggest that when sufficient amount of data is available to develop and rigorously validate QSAR models such models should be chosen as the preferred virtual screening tool in ligand-based computational drug discovery as compared to chemical similarity based approaches.

Keywords: GPCRs; Model validation; PASS; QSAR modeling; SEA.

Publication types

Comparative Study
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Binding, Competitive
Combinatorial Chemistry Techniques / methods*
Computational Biology / methods*
Databases, Factual
Drug Discovery / methods
Ligands
Quantitative Structure-Activity Relationship*
Receptors, G-Protein-Coupled / chemistry*
Receptors, G-Protein-Coupled / metabolism
Reproducibility of Results
Small Molecule Libraries / chemistry
Small Molecule Libraries / metabolism

Substances

Ligands
Receptors, G-Protein-Coupled
Small Molecule Libraries