The performance of deep learning on thyroid nodule imaging predicts thyroid cancer: A systematic review and meta-analysis of epidemiological studies with independent external test sets

Jin Xu; He-Li Xu; Yi-Ning Cao; Ying Huang; Song Gao; Qi-Jun Wu; Ting-Ting Gong

doi:10.1016/j.dsx.2023.102891

The performance of deep learning on thyroid nodule imaging predicts thyroid cancer: A systematic review and meta-analysis of epidemiological studies with independent external test sets

Diabetes Metab Syndr. 2023 Nov;17(11):102891. doi: 10.1016/j.dsx.2023.102891. Epub 2023 Oct 25.

Authors

Jin Xu¹, He-Li Xu², Yi-Ning Cao³, Ying Huang⁴, Song Gao¹, Qi-Jun Wu⁵, Ting-Ting Gong⁶

Affiliations

¹ Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China.
² Department of Clinical Epidemiology, Shengjing Hospital of China Medical University, Shenyang, China.
³ Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China; Department of Clinical Epidemiology, Shengjing Hospital of China Medical University, Shenyang, China.
⁴ Department of Ultrasound, Shengjing Hospital of China Medical University, Shenyang, China.
⁵ Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China; Department of Clinical Epidemiology, Shengjing Hospital of China Medical University, Shenyang, China; Key Laboratory of Reproductive and Genetic Medicine (China Medical University), National Health Commission, Shenyang, China. Electronic address: wuqj@sj-hospital.org.
⁶ Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China. Electronic address: gongtt@sj-hospital.org.

PMID: 37907027
DOI: 10.1016/j.dsx.2023.102891

Abstract

Background and aims: It is still controversial whether deep learning (DL) systems add accuracy to thyroid nodule imaging classification based on the recent available evidence. We conducted this study to analyze the current evidence of DL in thyroid nodule imaging diagnosis in both internal and external test sets.

Methods: Until the end of December 2022, PubMed, IEEE, Embase, Web of Science, and the Cochrane Library were searched. We included primary epidemiological studies using externally validated DL techniques in image-based thyroid nodule appraisal. This systematic review was registered on PROSPERO (CRD42022362892).

Results: We evaluated evidence from 17 primary epidemiological studies using externally validated DL techniques in image-based thyroid nodule appraisal. Fourteen studies were deemed eligible for meta-analysis. The pooled sensitivity, specificity, and area under the curve (AUC) of these DL algorithms were 0.89 (95% confidence interval 0.87-0.90), 0.84 (0.82-0.86), and 0.93 (0.91-0.95), respectively. For the internal validation set, the pooled sensitivity, specificity, and AUC were 0.91 (0.89-0.93), 0.88 (0.85-0.91), and 0.96 (0.93-0.97), respectively. In the external validation set, the pooled sensitivity, specificity, and AUC were 0.87 (0.85-0.89), 0.81 (0.77-0.83), and 0.91 (0.88-0.93), respectively. Notably, in subgroup analyses, DL algorithms still demonstrated exceptional diagnostic validity.

Conclusions: Current evidence suggests DL-based imaging shows diagnostic performances comparable to clinicians for differentiating thyroid nodules in both the internal and external test sets.

Keywords: Deep learning; External validation; Imaging diagnosis; Meta-analysis; Thyroid nodule.

Publication types

Meta-Analysis
Systematic Review

MeSH terms

Deep Learning*
Diagnosis, Differential
Epidemiologic Studies
Humans
Sensitivity and Specificity
Thyroid Neoplasms* / diagnostic imaging
Thyroid Neoplasms* / epidemiology
Thyroid Nodule* / diagnostic imaging
Thyroid Nodule* / epidemiology