Modeling Cases and Deaths per Million using Daily-Aggregated Facebook COVID-19 Symptom Survey Data

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov:2021:1627-1630. doi: 10.1109/EMBC46164.2021.9630870.

Abstract

We develop a novel analytic approach to modeling future COVID-19 risk using COVID-19 Symptom Survey data aggregated daily by US state, joined with daily time-series data on confirmed cases and deaths. Specifically, we model N-day forward-looking estimates for per-US-state-per-day change in deaths per million (DPM) and cases per million (CPM) using a multivariate regression model to below baseline error (65% and 38% mean absolute percentage error for DPM/CPM, respectively). Additionally, we model future changes in the curvature of CPM/DPM as "increasing" or "decreasing" using a random forest classifier to above 72% accuracy. In sum, we develop and characterize models to establish a relationship between behaviors and beliefs of individuals captured via the Facebook COVID-19 Symptom Surveys and the trajectory of COVID-19 outbreaks evidenced in terms of CPM and DPM. Such information can be helpful in assessing collective risks of infection and death during a pandemic as well as in determining the effectiveness of appropriate risk mitigation strategies based on behaviors evidenced through survey responses.

MeSH terms

  • COVID-19*
  • Humans
  • SARS-CoV-2
  • Social Media*