Mining Public Opinions on COVID-19 Vaccination: A Temporal Analysis to Support Combating Misinformation

Trop Med Infect Dis. 2022 Sep 22;7(10):256. doi: 10.3390/tropicalmed7100256.

Abstract

This article presents a study that applied opinion analysis about COVID-19 immunization in Brazil. An initial set of 143,615 tweets was collected containing 49,477 pro- and 44,643 anti-vaccination and 49,495 neutral posts. Supervised classifiers (multinomial naïve Bayes, logistic regression, linear support vector machines, random forests, adaptative boosting, and multilayer perceptron) were tested, and multinomial naïve Bayes, which had the best trade-off between overfitting and correctness, was selected to classify a second set containing 221,884 unclassified tweets. A timeline with the classified tweets was constructed, helping to identify dates with peaks in each polarity and search for events that may have caused the peaks, providing methodological assistance in combating sources of misinformation linked to the spread of anti-vaccination opinion.

Keywords: Brazil; COVID-19; misinformation; opinion mining; pandemics; temporal analysis; twitter data; vaccination.

Grants and funding

This research received no external funding.