Automatic Delineation of Gross Tumor Volume Based on Magnetic Resonance Imaging by Performing a Novel Semisupervised Learning Framework in Nasopharyngeal Carcinoma

Int J Radiat Oncol Biol Phys. 2022 Jul 15;113(4):893-902. doi: 10.1016/j.ijrobp.2022.03.031. Epub 2022 Apr 3.

Abstract

Purpose: We aimed to validate the accuracy and clinical value of a novel semisupervised learning framework for gross tumor volume (GTV) delineation in nasopharyngeal carcinoma.

Methods and materials: Two hundred fifty-eight patients with magnetic resonance imaging data sets were divided into training (n = 180), validation (n = 20), and testing (n = 58) cohorts. Ground truth contours of nasopharynx GTV (GTVnx) and node GTV (GTVnd) were manually delineated by 2 experienced radiation oncologists. Twenty percent (n = 36) labeled and 80% (n = 144) unlabeled images were used to train the model, producing model-generated contours for patients from the testing cohort. Nine experienced experts were invited to revise model-generated GTV in 20 randomly selected patients from the testing cohort. Six junior oncologists were asked to delineate GTV in 12 randomly selected patients from the testing cohort without and with the assistance of the model, and revision degrees were compared under these 2 modes. The Dice similarity coefficient (DSC) was used to quantify the accuracy of the model.

Results: The model-generated contours showed a high accuracy compared with ground truth contours, with an average DSC score of 0.83 and 0.80 for GTVnx and GTVnd, respectively. There was no significant difference in DSC score between T1-2 and T3-4 patients (0.81 vs 0.83; P = .223), or between N1-2 and N3 patients (0.80 vs 0.79; P = .807). The mean revision degree was lower than 10% in 19 (95%) patients for GTVnx and in 16 (80%) patients for GTVnd. With assistance of the model, the mean revision degree for GTVnx and GTVnd by junior oncologists was reduced from 25.63% to 7.75% and from 21.38% to 14.44%, respectively. Meanwhile, the delineating efficiency was improved by over 60%.

Conclusions: The proposed semisupervised learning-based model showed a high accuracy for delineating GTV of nasopharyngeal carcinoma. It was clinically applicable and could assist junior oncologists to improve GTV contouring accuracy and save contouring time.

MeSH terms

  • Humans
  • Magnetic Resonance Imaging* / methods
  • Nasopharyngeal Carcinoma / diagnostic imaging
  • Nasopharyngeal Neoplasms* / diagnostic imaging
  • Nasopharyngeal Neoplasms* / radiotherapy
  • Nasopharynx
  • Tumor Burden