Prospective and External Validation of an Ensemble Learning Approach to Sensitively Detect Intravenous Fluid Contamination in Basic Metabolic Panels

Nicholas C Spies; Leah Militello; Christopher W Farnsworth; Joe M El-Khoury; Thomas J S Durant; Mark A Zaydman

doi:10.1093/clinchem/hvae168

Prospective and External Validation of an Ensemble Learning Approach to Sensitively Detect Intravenous Fluid Contamination in Basic Metabolic Panels

Clin Chem. 2024 Nov 15:hvae168. doi: 10.1093/clinchem/hvae168. Online ahead of print.

Authors

Nicholas C Spies^{1

2

3}, Leah Militello⁴, Christopher W Farnsworth¹, Joe M El-Khoury⁴, Thomas J S Durant⁴, Mark A Zaydman¹

Affiliations

¹ Department of Pathology, Washington University in St. Louis School of Medicine, St. Louis, MO, United States.
² Division of Research and Innovation, ARUP Laboratories, Salt Lake City, UT, United States.
³ Department of Pathology, University of Utah Health, Salt Lake City, UT, United States.
⁴ Department of Laboratory Medicine, Yale School of Medicine, New Haven, CT, United States.

PMID: 39545815
DOI: 10.1093/clinchem/hvae168

Abstract

Background: Intravenous (IV) fluid contamination within clinical specimens causes an operational burden on the laboratory when detected, and potential patient harm when undetected. Even mild contamination is often sufficient to meaningfully alter results across multiple analytes. A recently reported unsupervised learning approach was more sensitive than routine workflows, but still lacked sensitivity to mild but significant contamination. Here, we leverage ensemble learning to more sensitively detect contaminated results using an approach which is explainable and generalizable across institutions.

Methods: An ensemble-based machine learning pipeline of general and fluid-specific models was trained on real-world and simulated contamination and internally and externally validated. Benchmarks for performance assessment were derived from in silico simulations, in vitro experiments, and expert review. Fluid-specific regression models estimated contamination severity. SHapley Additive exPlanation (SHAP) values were calculated to explain specimen-level predictions, and algorithmic fairness was evaluated by comparing flag rates across demographic and clinical subgroups.

Results: The sensitivities, specificities, and Matthews correlation coefficients were 0.858, 0.993, and 0.747 for the internal validation set, and 1.00, 0.980, and 0.387 for the external set. SHAP values provided plausible explanations for dextrose- and ketoacidosis-related hyperglycemia. Flag rates from the pipeline were higher than the current workflow, with improved detection of contamination events expected to exceed allowable limits for measurement error and reference change values.

Conclusions: An accurate, generalizable, and explainable ensemble-based machine learning pipeline was developed and validated for sensitively detecting IV fluid contamination. Implementing this pipeline would help identify errors that are poorly detected by current clinical workflows and a previously described unsupervised machine learning-based method.

© Association for Diagnostics & Laboratory Medicine 2024. All rights reserved. For commercial re-use, please contact reprints@oup.com for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site—for further information please contact journals.permissions@oup.com.