Prediction of carcinogenic human papillomavirus types in cervical cancer from multiparametric magnetic resonance images with machine learning-based radiomics models

Diagn Interv Radiol. 2023 May 31;29(3):460-468. doi: 10.4274/dir.2022.221335. Epub 2022 Dec 21.

Abstract

Purpose: This study aimed to evaluate the potential of machine learning-based models for predicting carcinogenic human papillomavirus (HPV) oncogene types using radiomics features from magnetic resonance imaging (MRI).

Methods: Pre-treatment MRI images of patients with cervical cancer were collected retrospectively. An HPV DNA oncogene analysis was performed based on cervical biopsy specimens. Radiomics features were extracted from contrast-enhanced T1-weighted images (CE-T1) and T2-weighted images (T2WI). A third feature subset was created as a combined group by concatenating the CE-T1 and T2WI subsets. Feature selection was performed using Pearson's correlation coefficient and wrapper- based sequential-feature selection. Two models were built with each feature subset, using support vector machine (SVM) and logistic regression (LR) classifiers. The models were validated using a five-fold cross-validation technique and compared using Wilcoxon's signed rank and Friedman's tests.

Results: Forty-one patients were enrolled in the study (26 were positive for carcinogenic HPV oncogenes, and 15 were negative). A total of 851 features were extracted from each imaging sequence. After feature selection, 5, 17, and 20 features remained in the CE-T1, T2WI, and combined groups, respectively. The SVM models showed 83%, 95%, and 95% accuracy scores, and the LR models revealed 83%, 81%, and 92.5% accuracy scores in the CE-T1, T2WI, and combined groups, respectively. The SVM algorithm performed better than the LR algorithm in the T2WI feature subset (P = 0.005), and the feature sets in the T2WI and the combined group performed better than CE-T1 in the SVM model (P = 0.033 and 0.006, respectively). The combined group feature subset performed better than T2WI in the LR model (P = 0.023).

Conclusion: Machine learning-based radiomics models based on pre-treatment MRI can detect carcinogenic HPV status with discriminative accuracy.

Keywords: Artificial intelligence; human papillomavirus DNA tests; machine learning; radiology; uterine cervical neoplasms.

MeSH terms

  • Carcinogens
  • Female
  • Human Papillomavirus Viruses
  • Humans
  • Machine Learning
  • Magnetic Resonance Imaging / methods
  • Papillomavirus Infections* / diagnostic imaging
  • Retrospective Studies
  • Uterine Cervical Neoplasms* / diagnostic imaging
  • Uterine Cervical Neoplasms* / pathology

Substances

  • Carcinogens