Design and Experiment of a Portable Near-Infrared Spectroscopy Device for Convenient Prediction of Leaf Chlorophyll Content

Sensors (Basel). 2023 Oct 19;23(20):8585. doi: 10.3390/s23208585.

Abstract

This study designs a spectrum data collection device and system based on the Internet of Things technology, aiming to solve the tedious process of chlorophyll collection and provide a more convenient and accurate method for predicting chlorophyll content. The device has the advantages of integrated design, portability, ease of operation, low power consumption, low cost, and low maintenance requirements, making it suitable for outdoor spectrum data collection and analysis in fields such as agriculture, environment, and geology. The core processor of the device uses the ESP8266-12F microcontroller to collect spectrum data by communicating with the spectrum sensor. The spectrum sensor used is the AS7341 model, but its limited number of spectral acquisition channels and low resolution may limit the exploration and analysis of spectral data. To verify the performance of the device and system, this experiment collected spectral data of Hami melon leaf samples and combined it with a chlorophyll meter for related measurements and analysis. In the experiment, twelve regression algorithms were tested, including linear regression, decision tree, and support vector regression. The results showed that in the original spectral data, the ETR method had the best prediction effect at a wavelength of 515 nm. In the training set, RMSEc was 0.3429, and Rc2 was 0.9905. In the prediction set, RMSEp was 1.5670, and Rp2 was 0.8035. In addition, eight preprocessing methods were used to denoise the original data, but the improvement in prediction accuracy was not significant. To further improve the accuracy of data analysis, principal component analysis and isolation forest algorithm were used to detect and remove outliers in the spectral data. After removing the outliers, the RFR model performed best in predicting all wavelength combinations of denoised spectral data using PBOR. In the training set, RMSEc was 0.8721, and Rc2 was 0.9429. In the prediction set, RMSEp was 1.1810, and Rp2 was 0.8683.

Keywords: Internet of Things technology; chlorophyll content prediction; data preprocessing; regression algorithms; spectral data analysis.

MeSH terms

  • Agriculture
  • Chlorophyll* / analysis
  • Least-Squares Analysis
  • Plant Leaves / chemistry
  • Spectroscopy, Near-Infrared* / methods

Substances

  • Chlorophyll