Predictive Value of Sequential Organ Failure Assessment Score across Patients with and without COVID-19 Infection

Ann Am Thorac Soc. 2022 May;19(5):790-798. doi: 10.1513/AnnalsATS.202106-680OC.

Abstract

Rationale: Sequential organ failure assessment (SOFA) scores are commonly used in crisis standards of care policies to assist in resource allocation. The relative predictive value of SOFA by coronavirus disease (COVID-19) infection status and among racial and ethnic subgroups within patients infected with COVID-19 is unknown. Objectives: To evaluate the accuracy and calibration of SOFA in predicting hospital mortality by COVID-19 infection status and across racial and ethnic subgroups. Methods: We performed a retrospective cohort study of adult admissions to the University of Miami Hospital and Clinics inpatient wards (July 1, 2020-April 1, 2021). We primarily considered maximum SOFA within 48 hours of hospitalization. We assessed accuracy using the area under the receiver operating characteristic curve (AUROC) and created calibration belts. Considered subgroups were defined by COVID-19 infection status (by severe acute respiratory syndrome coronavirus 2 polymerase chain reaction testing) and prevalent racial and ethnic minorities. Comparisons across subgroups were made with DeLong testing for discriminative accuracy and visualization of calibration belts. Results: Our primary cohort consisted of 20,045 hospitalizations, of which 1,894 (9.5%) were COVID-19 positive. SOFA was similarly accurate for COVID-19-positive (AUROC, 0.835) and COVID-19-negative (AUROC, 0.810; P = 0.15) admissions but was slightly better calibrated in patients who were positive for COVID-19. For those with critical illness, maximum SOFA score accuracy at critical illness onset also did not differ by COVID-19 status (AUROC, COVID-19 positive vs. negative: intensive care unit admissions, 0.751 vs. 0.775; P = 0.46; mechanically ventilated, 0.713 vs. 0.792, P = 0.13), and calibration was again better for patients positive for COVID-19. Among patients with COVID-19, SOFA accuracy was similar between the non-Hispanic White population (AUROC, 0.894) and racial and ethnic minorities (Hispanic White population: AUROC, 0.824 [P vs. non-Hispanic White = 0.05]; non-Hispanic Black population: AUROC, 0.800 [P = 0.12]; Hispanic Black population: AUROC, 0.948 [P = 0.31]). This similar accuracy was also found for those without COVID-19 (non-Hispanic White population: AUROC, 0.829; Hispanic White population: AUROC, 0.811 [P = 0.37]; Hispanic Black population: AUROC, 0.828 [P = 0.97]; non-Hispanic Black population: AUROC, 0.867 [P = 0.46]). SOFA was well calibrated for all racial and ethnic groups with COVID-19 but estimated mortality more variably and performed less well across races and ethnicities without COVID-19. Conclusions: SOFA accuracy does not differ by COVID-19 status and is similar among racial and ethnic groups both with and without COVID-19. Calibration is better for COVID-19-infected patients and, among those without COVID-19, varies by race and ethnicity.

Keywords: COVID-19; calibration; ethnic groups; organ dysfunction scores; race factors.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • COVID-19*
  • Critical Illness
  • Hospital Mortality
  • Humans
  • Organ Dysfunction Scores*
  • Retrospective Studies