vScreenML v2.0: Improved Machine Learning Classification for Reducing False Positives in Structure-Based Virtual Screening

Grigorii V Andrianov; Emeline Haroldsen; John Karanicolas

doi:10.3390/ijms252212350

vScreenML v2.0: Improved Machine Learning Classification for Reducing False Positives in Structure-Based Virtual Screening

Int J Mol Sci. 2024 Nov 18;25(22):12350. doi: 10.3390/ijms252212350.

Authors

Grigorii V Andrianov^{1

2}, Emeline Haroldsen¹, John Karanicolas^{1

3}

Affiliations

¹ Cancer Signaling & Microenvironment Program, Fox Chase Cancer Center, Philadelphia, PA 19111, USA.
² Institute of Fundamental Medicine and Biology, Kazan Federal University, Kazan 420008, Russia.
³ Moulder Center for Drug Discovery Research, Temple University School of Pharmacy, Philadelphia, PA 19140, USA.

Abstract

The enthusiastic adoption of make-on-demand chemical libraries for virtual screening has highlighted the need for methods that deliver improved hit-finding discovery rates. Traditional virtual screening methods are often inaccurate, with most compounds nominated in a virtual screen not engaging the intended target protein to any detectable extent. Emerging machine learning approaches have made significant progress in this regard, including our previously described tool vScreenML. The broad adoption of vScreenML was hindered by its challenging usability and dependencies on certain obsolete or proprietary software packages. Here, we introduce vScreenML 2.0 to address each of these limitations with a streamlined Python implementation. Through careful benchmarks, we show that vScreenML 2.0 outperforms other widely used tools for virtual screening hit discovery.

Keywords: drug discovery; machine learning; virtual screening.

MeSH terms

Drug Discovery / methods
Drug Evaluation, Preclinical / methods
Humans
Machine Learning*
Small Molecule Libraries / chemistry
Software*
User-Computer Interface

Substances

Small Molecule Libraries

Abstract

MeSH terms

Substances

Grants and funding