A machine learning model for orthodontic extraction/non-extraction decision in a racially and ethnically diverse patient population

Taylor Mason; Kynnedy M Kelly; George Eckert; Jeffrey A Dean; M Murat Dundar; Hakan Turkkahraman

doi:10.1016/j.ortho.2023.100759

A machine learning model for orthodontic extraction/non-extraction decision in a racially and ethnically diverse patient population

Int Orthod. 2023 Sep;21(3):100759. doi: 10.1016/j.ortho.2023.100759. Epub 2023 May 15.

Authors

Taylor Mason¹, Kynnedy M Kelly², George Eckert³, Jeffrey A Dean⁴, M Murat Dundar⁵, Hakan Turkkahraman⁶

Affiliations

¹ Department of Orthodontics and Oral Facial Genetics, Indiana University School of Dentistry, Indianapolis, IN, US.
² Indiana University School of Dentistry, Indianapolis, IN, US.
³ Department of Biostatistics and Health Data Science, Indianapolis, Indiana University School of Medicine, IN, US.
⁴ Department of Pediatric Dentistry, Indiana University School of Dentistry, Indianapolis, IN, US.
⁵ Department of Computer & Information Science, Indiana University Purdue University at Indianapolis, School of Science, Indianapolis, IN, US.
⁶ Department of Orthodontics and Oral Facial Genetics, Indiana University School of Dentistry, Indianapolis, IN, US. Electronic address: haturk@iu.edu.

PMID: 37196482
DOI: 10.1016/j.ortho.2023.100759

Abstract

Introduction: The purpose of the present study was to create a machine learning (ML) algorithm with the ability to predict the extraction/non-extraction decision in a racially and ethnically diverse sample.

Methods: Data was gathered from the records of 393 patients (200 non-extraction and 193 extraction) from a racially and ethnically diverse population. Four ML models (logistic regression [LR], random forest [RF], support vector machine [SVM], and neural network [NN]) were trained on a training set (70% of samples) and then tested on the remaining samples (30%). The accuracy and precision of the ML model predictions were calculated using the area under the curve (AUC) of the receiver operating characteristics (ROC) curve. The proportion of correct extraction/non-extraction decisions was also calculated.

Results: The LR, SVM, and NN models performed best, with an AUC of the ROC of 91.0%, 92.5%, and 92.3%, respectively. The overall proportion of correct decisions was 82%, 76%, 83%, and 81% for the LR, RF, SVM, and NN models, respectively. The features found to be most helpful to the ML algorithms in making their decisions were maxillary crowding/spacing, L1-NB (mm), U1-NA (mm), PFH:AFH, and SN-MP(̊), although many other features contributed significantly.

Conclusions: ML models can predict the extraction decision in a racially and ethnically diverse patient population with a high degree of accuracy and precision. Crowding, sagittal, and vertical characteristics all featured prominently in the hierarchy of components most influential to the ML decision-making process.

Keywords: Artificial intelligence; Clinical Decision-Making; Machine learning; Orthodontics; Tooth Extraction.

MeSH terms

Algorithms*
Area Under Curve
Humans
Logistic Models
Machine Learning*
Random Forest