Classification of soybean chemical characteristics by excitation emission matrix coupled with t-SNE dimensionality reduction

Spectrochim Acta A Mol Biomol Spectrosc. 2024 Dec 5:322:124785. doi: 10.1016/j.saa.2024.124785. Epub 2024 Jul 4.

Abstract

Measuring the chemical composition in soybeans is time-consuming and laborious, and even simple near-infrared sensors generally require the creation of calibration curves before application. In this study, a new screening method for soybeans without calibration curves was investigated by combining the excitation emission matrix (EEM) and dimensionality reduction analysis. The EEMs of 34 soybean samples were measured, and representative chemical contents including crude protein, crude oil and isoflavone contents were measured by chemical analysis. Two methods of dimensionality reduction: principal component analysis (PCA) and t-distributed Stochastic Neighbor Embedding (t-SNE) were applied on the EEM data to obtain two-dimensional plots, which were divided into two regions with large or small amount of each chemical components. To classify the large or small levels of each of the chemical composition, machine learning classification models were constructed on the two-dimensional plots after dimensionality reduction. As a result, the classification accuracy was higher in t-SNE than in the combinations of PC1 and PC2 from PCA. Furthermore, in t-SNE, the classification accuracy reached over 90% for all the chemical components. From these results, t-SNE dimensionality reduction on the soybean EEM has the potential for easy and accurate screening of soybeans especially based on isoflavone contents.

Keywords: Excitation emission matrix (EEM); Isoflavone; Screening; Soybean; t-SNE.

MeSH terms

  • Glycine max* / chemistry
  • Glycine max* / classification
  • Isoflavones / analysis
  • Isoflavones / chemistry
  • Machine Learning
  • Principal Component Analysis*
  • Soybean Proteins / analysis
  • Soybean Proteins / chemistry
  • Soybean Proteins / classification

Substances

  • Isoflavones
  • Soybean Proteins