A Multianalyte Machine Learning Model to Detect Wrong Blood in Complete Blood Count Tube Errors in a Pediatric Setting

Brendan V Graham; Stephen R Master; Amrom E Obstfeld; Robert B Wilson

doi:10.1093/clinchem/hvae210

A Multianalyte Machine Learning Model to Detect Wrong Blood in Complete Blood Count Tube Errors in a Pediatric Setting

Clin Chem. 2025 Jan 11:hvae210. doi: 10.1093/clinchem/hvae210. Online ahead of print.

Authors

Brendan V Graham¹, Stephen R Master^{1

2}, Amrom E Obstfeld^{1

2}, Robert B Wilson^{1

2}

Affiliations

¹ Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, United States.
² Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, United States.

PMID: 39797417
DOI: 10.1093/clinchem/hvae210

Abstract

Background: Multianalyte machine learning (ML) models can potentially identify previously undetectable wrong blood in tube (WBIT) errors, improving upon current single-analyte delta check methodology. However, WBIT detection model performance has not been assessed in a real-world, low-prevalence context. To estimate real-world positive predictive values, we propose a methodology to assess WBIT detection models by evaluating the impact of missing data and by using a "low prevalence" validation data set.

Methods: We trained a range of model specifications using various predictors in a pediatric setting. We assessed the top-performing model on a modified, "low prevalence" validation data set across a range of probability thresholds. Model performance was also compared to a pre-positive patient identification (pre-PPID) dataset.

Results: An Extreme Gradient Boosting (XGBoost) model with minimal preprocessing performed the best for both complete blood count with differential white cell count (CBC with Diff) tests (accuracy 0.9715) and complete blood count without differential white cell count (CBC without Diff) tests (accuracy 0.9647). Assessment on a downsampled, "low prevalence" validation data set resulted in estimated positive predictive values ranging from 0.01 to 0.67 (CBC with Diff) and 0.01 to 0.75 (CBC without Diff), depending on the probability threshold chosen. A comparison of prospective performance to PPID data demonstrated a large decrease in estimated WBIT errors.

Conclusions: We find that ML models can accurately predict WBITs in a primarily pediatric setting. Evaluating model performance across a range of probability thresholds minimizes the number of false positives while still providing added safety benefits. The decrease in estimated WBITS post-PPID implementation shows the potential safety benefits of a WBIT model for hospitals not using PPID when collecting laboratory specimens.

© Association for Diagnostics & Laboratory Medicine 2025. All rights reserved. For commercial re-use, please contact reprints@oup.com for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site—for further information please contact journals.permissions@oup.com.