Machine Learning-Based Activity Pattern Classification Using Personal PM2.5 Exposure Information

Int J Environ Res Public Health. 2020 Sep 9;17(18):6573. doi: 10.3390/ijerph17186573.

Abstract

The activity pattern is a significant factor in identifying hotspots of personal exposure to air pollutants, such as PM2.5. However, the recording process of an activity pattern can be annoying to study participants, because they are often asked to bring a diary or a tracking recorder to write or validate their activity patterns when they change their activity profiles. Furthermore, the accuracy of the records of activity patterns can be lower, because people can mistakenly record them. Thus, this paper proposes an idea to overcome these problems and make the whole data-collection process easier and more reliable. Our idea was based on transforming training data using the statistical properties of the children's personal exposure level to PM2.5, temperature, and relative humidity and applying the properties to a decision tree algorithm for classification of activity patterns. From our final machine-learning modeling processes, we observed that the accuracy for activity-pattern classification was more than 90% in both the training and test data. We believe that our methodology can be used effectively in data-collection tasks and alleviate the annoyance that study participants may feel.

Keywords: PM2.5; activity-pattern analysis; environmental data; machine learning.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Air Pollutants* / analysis
  • Child
  • Environmental Exposure / analysis
  • Environmental Monitoring
  • Humans
  • Machine Learning*
  • Particulate Matter* / analysis
  • Temperature

Substances

  • Air Pollutants
  • Particulate Matter