Using Association Rules to Obtain Sets of Prevalent Symptoms throughout the COVID-19 Pandemic: An Analysis of Similarities between Cases of COVID-19 and Unspecified SARS in São Paulo-Brazil

Int J Environ Res Public Health. 2024 Sep 1;21(9):1164. doi: 10.3390/ijerph21091164.

Abstract

The efficient recognition of symptoms in viral infections holds promise for swift and precise diagnosis, thus mitigating health implications and the potential recurrence of infections. COVID-19 presents unique challenges due to various factors influencing diagnosis, especially regarding disease symptoms that closely resemble those of other viral diseases, including other strains of SARS, thus impacting the identification of useful and meaningful symptom patterns as they emerge in infections. Therefore, this study proposes an association rule mining approach, utilising the Apriori algorithm to analyse the similarities between individuals with confirmed SARS-CoV-2 diagnosis and those with unspecified SARS diagnosis. The objective is to investigate, through symptom rules, the presence of COVID-19 patterns among individuals initially not diagnosed with the disease. Experiments were conducted using cases from Brazilian SARS datasets for São Paulo State. Initially, reporting percentage similarities of symptoms in both groups were analysed. Subsequently, the top ten rules from each group were compared. Finally, a search for the top five most frequently occurring positive rules among the unspecified ones, and vice versa, was conducted to identify identical rules, with a particular focus on the presence of positive rules among the rules of individuals initially diagnosed with unspecified SARS.

Keywords: COVID-19; apriori; association rule mining; symptom patterns; symptoms.

MeSH terms

  • Algorithms
  • Brazil / epidemiology
  • COVID-19* / epidemiology
  • Humans
  • Pandemics
  • Prevalence
  • SARS-CoV-2*
  • Severe Acute Respiratory Syndrome / epidemiology