A Prospective Study on Risk Prediction of Preeclampsia Using Bi-Platform Calibration and Machine Learning

Int J Mol Sci. 2024 Oct 4;25(19):10684. doi: 10.3390/ijms251910684.

Abstract

Preeclampsia is a pregnancy syndrome characterized by complex symptoms which cause maternal and fetal problems and deaths. The aim of this study is to achieve preeclampsia risk prediction and early risk prediction in Xinjiang, China, based on the placental growth factor measured using the SiMoA or Elecsys platform. A novel reliable calibration modeling method and missing data imputing method are proposed, in which different strategies are used to adapt to small samples, training data, test data, independent features, and dependent feature pairs. Multiple machine learning algorithms were applied to train models using various datasets, such as single-platform versus bi-platform data, early pregnancy versus early plus non-early pregnancy data, and real versus real plus augmented data. It was found that a combination of two types of mono-platform data could improve risk prediction performance, and non-early pregnancy data could enhance early risk prediction performance when limited early pregnancy data were available. Additionally, the inclusion of augmented data resulted in achieving a high but unstable performance. The models in this study significantly reduced the incidence of preeclampsia in the region from 7.2% to 2.0%, and the mortality rate was reduced to 0%.

Keywords: bi-platform calibration; data imbalance problem; multilayer perceptron; preeclampsia risk prediction; random forest algorithm.

MeSH terms

  • Adult
  • Algorithms
  • Calibration
  • China / epidemiology
  • Female
  • Humans
  • Machine Learning*
  • Placenta Growth Factor / blood
  • Placenta Growth Factor / metabolism
  • Pre-Eclampsia* / diagnosis
  • Pregnancy
  • Prospective Studies
  • Risk Assessment / methods
  • Risk Factors

Substances

  • Placenta Growth Factor