EHR-based Case Identification of Pediatric Long COVID: A Report from the RECOVER EHR Cohort

Morgan Botdorf; Kimberley Dickinson; Vitaly Lorman; Hanieh Razzaghi; Nicole Marchesani; Suchitra Rao; Colin Rogerson; Miranda Higginbotham; Asuncion Mejias; Daria Salyakina; Deepika Thacker; Dima Dandachi; Dimitri A Christakis; Emily Taylor; Hayden Schwenk; Hiroki Morizono; Jonathan Cogen; Nathan M Pajor; Ravi Jhaveri; Christopher B Forrest; L Charles Bailey

doi:10.1101/2024.05.23.24307492

EHR-based Case Identification of Pediatric Long COVID: A Report from the RECOVER EHR Cohort

medRxiv [Preprint]. 2024 Aug 26:2024.05.23.24307492. doi: 10.1101/2024.05.23.24307492.

Authors

Morgan Botdorf¹, Kimberley Dickinson¹, Vitaly Lorman¹, Hanieh Razzaghi¹, Nicole Marchesani¹, Suchitra Rao², Colin Rogerson³, Miranda Higginbotham¹, Asuncion Mejias⁴, Daria Salyakina⁵, Deepika Thacker⁶, Dima Dandachi⁷, Dimitri A Christakis⁸, Emily Taylor⁹, Hayden Schwenk¹⁰, Hiroki Morizono¹¹, Jonathan Cogen¹², Nathan M Pajor¹³, Ravi Jhaveri¹⁴, Christopher B Forrest¹, L Charles Bailey¹

Affiliations

¹ Applied Clinical Research Center, Children's Hospital of Philadelphia, Philadelphia, PA.
² Department of Pediatrics, University of Colorado School of Medicine and Children's Hospital Colorado, Denver, CO.
³ Division of Critical Care, Department of Pediatrics, Indiana University School of Medicine, Indianapolis, IN.
⁴ Division of Infectious Diseases, Department of Pediatrics, Nationwide Children's Hospital and The Ohio State University, Columbus, OH.
⁵ Center for Precision Medicine, Nicklaus Children's Hospital, Miami, FL.
⁶ Nemours Cardiac Center, Alfred I. duPont Hospital for Children, Wilmington, DE.
⁷ Division of Infectious Diseases, Department of Medicine, University of Missouri-Columbia, Columbia, MO.
⁸ Center for Child Health, Behavior and Development, Seattle Children's Research Institute, Seattle, WA.
⁹ RECOVER Patient, Caregiver, or Community Representative New York, NY, USA.
¹⁰ Division of Pediatric Infectious Diseases, Stanford School of Medicine, Palo Alto, CA.
¹¹ Center for Genetic Medicine Research, Children's National Hospital, Washington, DC.
¹² Division of Pulmonary and Sleep Medicine, Department of Pediatrics, Seattle Children's Hospital, University of Washington, Seattle, WA.
¹³ Division of Pulmonary Medicine, Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati OH.
¹⁴ Division of Infectious Diseases, Ann & Robert H. Lurie Children's Hospital of Chicago, Chicago, IL.

Abstract

Objective: Long COVID, marked by persistent, recurring, or new symptoms post-COVID-19 infection, impacts children's well-being yet lacks a unified clinical definition. This study evaluates the performance of an empirically derived Long COVID case identification algorithm, or computable phenotype, with manual chart review in a pediatric sample. This approach aims to facilitate large-scale research efforts to understand this condition better.

Methods: The algorithm, composed of diagnostic codes empirically associated with Long COVID, was applied to a cohort of pediatric patients with SARS-CoV-2 infection in the RECOVER PCORnet EHR database. The algorithm classified 31,781 patients with conclusive, probable, or possible Long COVID and 307,686 patients without evidence of Long COVID. A chart review was performed on a subset of patients (n=651) to determine the overlap between the two methods. Instances of discordance were reviewed to understand the reasons for differences.

Results: The sample comprised 651 pediatric patients (339 females, M _age = 10.10 years) across 16 hospital systems. Results showed moderate overlap between phenotype and chart review Long COVID identification (accuracy = 0.62, PPV = 0.49, NPV = 0.75); however, there were also numerous cases of disagreement. No notable differences were found when the analyses were stratified by age at infection or era of infection. Further examination of the discordant cases revealed that the most common cause of disagreement was the clinician reviewers' tendency to attribute Long COVID-like symptoms to prior medical conditions. The performance of the phenotype improved when prior medical conditions were considered (accuracy = 0.71, PPV = 0.65, NPV = 0.74).

Conclusions: Although there was moderate overlap between the two methods, the discrepancies between the two sources are likely attributed to the lack of consensus on a Long COVID clinical definition. It is essential to consider the strengths and limitations of each method when developing Long COVID classification algorithms.

Keywords: Chart review; Chronic COVID-19 Syndrome; Electronic health records; Electronic phenotyping; Late sequelae of COVID-19; Long COVID; Long haul COVID; Long-term COVID-19; PEDSnet; Post COVID syndrome; Post-acute COVID-19; Post-acute sequelae SARS-CoV-2 infection; Rule-based phenotyping.

Publication types

Preprint

Grants and funding

OT2 HL161847/HL/NHLBI NIH HHS/United States