Shedding light on dark figures: Steps towards a methodology for estimating actual numbers of COVID-19 infections in Germany based on Google Trends

PLoS One. 2022 Oct 26;17(10):e0276485. doi: 10.1371/journal.pone.0276485. eCollection 2022.

Abstract

In order to shed light on unmeasurable real-world phenomena, we investigate exemplarily the actual number of COVID-19 infections in Germany based on big data. The true occurrence of infections is not visible, since not every infected person is tested. This paper demonstrates that coronavirus-related search queries issued on Google can depict true infection levels appropriately. We find significant correlation between search volume and national as well as federal COVID-19 cases as reported by RKI. Additionally, we discover indications that the queries are indeed causal for infection levels. Finally, this approach can replicate varying dark figures throughout different periods of the pandemic and enables early insights into the true spread of future virus outbreaks. This is of high relevance for society in order to assess and understand the current situation during virus outbreaks and for decision-makers to take adequate and justifiable health measures.

MeSH terms

  • COVID-19* / epidemiology
  • Disease Outbreaks
  • Germany / epidemiology
  • Humans
  • Pandemics
  • Search Engine

Grants and funding

The author received no specific funding for this work.