Introducing the CSP Analyzer: A novel Machine Learning-based application for automated analysis of two-dimensional NMR spectra in NMR fragment-based screening

Comput Struct Biotechnol J. 2020 Feb 28:18:603-611. doi: 10.1016/j.csbj.2020.02.015. eCollection 2020.

Abstract

NMR-based screening, especially fragment-based drug discovery is a valuable approach in early-stage drug discovery. Monitoring fragment-binding in protein-detected 2D NMR experiments requires analysis of hundreds of spectra to detect chemical shift perturbations (CSPs) in the presence of ligands screened. Computational tools are available that simplify the tracking of CSPs in 2D NMR spectra. However, to the best of our knowledge, an efficient automated tool for the assessment and binning of multiple spectra for ligand binding has not yet been described. We present a novel and fast approach for analysis of multiple 2D HSQC spectra based on machine-learning-driven statistical discrimination. The CSP Analyzer features a C# frontend interfaced to a Python ML classifier. The software allows rapid evaluation of 2D screening data from large number of spectra, reducing user-introduced bias in the evaluation. The CSP Analyzer software package is available on GitHub https://github.com/rubbs14/CSP-Analyzer/releases/tag/v1.0 under the GPL license 3.0 and is free to use for academic and commercial uses.

Keywords: 2-D NMR; Automatic CSP analysis; C# GUI; Fragment screening; Fragment-based drug discovery; Machine-learning.