Background: Gestational Diabetes Mellitus (GDM) is a common complication during pregnancy. Late diagnosis can have significant implications for both the mother and the fetus. This research aims to create an early prediction model for GDM in the first trimester of pregnancy. This model will help obstetricians and gynecologists make appropriate decisions for treating and preventing GDM complications.
Methods: This applied descriptive study was conducted at the fertility health center of Vali-e-Asr Hospital in Tehran, Iran. The dataset was collected from the records of pregnant women registered in the hospital's system from 2020 to 2022. Risk factors for designing predictive models were identified through literature review, expert opinions, and clinical specialists' input. The extracted information underwent preprocessing, and six machine learning (ML) methods were developed and evaluated for GDM prediction in the first trimester of pregnancy: decision tree (DT), multilayer perceptron (MLP), k-nearest neighbors (KNN), Naïve Bayes (NB), random forest (RF), and extreme gradient boosting (XGBoost).
Results: Models were evaluated using accuracy, precision, sensitivity, and the area under the receiver operating characteristic curve (AUC). Based on the glucose tolerance test (GTT) results, the RF model achieved the best performance in GDM prediction, with 89% accuracy, 86% precision, 92% recall, and 94% AUC, using demographic variables, medical history, and clinical findings. In modeling based on insulin consumption, the RF model achieved the best results with 62% accuracy, 60% precision, 63% recall, and 63% AUC in predicting GDM using demographic variables and medical history.
Conclusion: The results of this study demonstrate that ML algorithms, especially RF, have acceptable accuracy in the early prediction of GDM during the first trimester of pregnancy.
Keywords: Artificial intelligence; First trimester of pregnancy; Gestational diabetes mellitus; Machine learning; Prediction; Random forest.
© 2024. The Author(s).