Background: The role of artificial intelligence (AI) in the discrimination between pulmonary cryptococcosis (PC) and lung adenocarcinoma (LA) warrants further research.
Objectives: To compare the performances of AI models with clinicians in distinguishing PC from LA on chest CT.
Methods: Patients diagnosed with confirmed PC or LA were retrospectively recruited from three tertiary hospitals in Guangzhou. A deep learning framework was employed to develop two models: an undelineated supervised training (UST) model utilising original CT images, and a delineated supervised training (DST) model utilising CT images with manual lesion annotations provided by physicians. A subset of 20 cases was randomly selected from the entire dataset and reviewed by clinicians through a network questionnaire. The sensitivity, specificity and accuracy of the models and the clinicians were calculated.
Results: A total of 395 PC cases and 249 LA cases were included in the final analysis. The internal validation results for the UST model showed a sensitivity of 85.3%, specificity of 81.0%, accuracy of 83.6% and an area under the curve (AUC) of 0.93. Similarly, the DST model exhibited a sensitivity of 88.2%, specificity of 88.1%, accuracy of 88.2% and an AUC of 0.94. The external validation of the two models yielded AUC values of 0.74 and 0.77, respectively. The average sensitivity, specificity and accuracy of 102 clinicians were determined to be 63.1%, 53.7% and 59.3%, respectively.
Conclusions: Both models outperformed the clinicians in distinguishing between PC and LA on chest CT, with the UST model exhibiting comparable performance to the DST model.
Keywords: artificial intelligence; computed tomography; deep learning; lung adenocarcinoma; pulmonary cryptococcosis; supervised training.
© 2024 Wiley-VCH GmbH. Published by John Wiley & Sons Ltd.