Rapid diagnosis and recurrence prediction of choledocholithiasis disease using raw bile with machine learning assisted SERS

Talanta. 2025 Jan 1:282:126979. doi: 10.1016/j.talanta.2024.126979. Epub 2024 Oct 9.

Abstract

Surface-enhanced Raman spectroscopy (SERS) analysis based on body fluids has been widely applied in disease diagnose. Choledocholithiasis is a widespread and often recurrent digestive system disease, with limited data on factors predicting its formation and reappearance. Bile contains many components that could provide valuable diagnostic information; however, the current diagnosis of biliary disease by SERS focuses on detecting specific component in the bile, overlooking the complex interplay and correlations among multiple factors that could be crucial for accurate diagnosis. This work directly obtained multi-component SERS spectral information of raw bile from 46 patients. Characteristic information was extracted from bile SERS spectra using Principal Component Analysis (PCA), revealing variations in the content of characteristic components associated with different choledocholithiasis types and their recurrence frequency. Pearson correlation analysis was also introduced to reveal the interactions of primary active substances pertinent to choledocholithiasis diagnosis. The efficacy of PCA and Support Vector Machine (SVM) models in classifying stone types, presented an accuracy of 99.2 %. Furthermore, the interaction patterns among SERS characteristic components in choledocholithiasis recurrence frequency were revealed, and with the support of SVM, the prediction for different recurrence rates reached an accuracy of 95.2 %. Overall, this work demonstrates that integrating SERS with machine learning can support disease diagnosis and the interpretation of correlations among multiple components, facilitating elucidating the disease mechanisms.

Keywords: Bile; Choledocholithiasis; Correlation; Disease; Machine learning; SERS.

MeSH terms

  • Aged
  • Bile* / chemistry
  • Choledocholithiasis* / diagnosis
  • Female
  • Humans
  • Machine Learning*
  • Male
  • Middle Aged
  • Principal Component Analysis
  • Recurrence*
  • Spectrum Analysis, Raman* / methods
  • Support Vector Machine