Machine learning-based prediction of pulmonary embolism to reduce unnecessary computed tomography scans in gastrointestinal cancer patients: a retrospective multicenter study

Joo Seong Kim; Doyun Kwon; Kyungdo Kim; Sang Hyub Lee; Seung-Bo Lee; Kwangsoo Kim; Dongmin Kim; Min Woo Lee; Namyoung Park; Jin Ho Choi; Eun Sun Jang; In Rae Cho; Woo Hyun Paik; Jun Kyu Lee; Ji Kon Ryu; Yong-Tae Kim

doi:10.1038/s41598-024-75977-y

Machine learning-based prediction of pulmonary embolism to reduce unnecessary computed tomography scans in gastrointestinal cancer patients: a retrospective multicenter study

Sci Rep. 2024 Oct 25;14(1):25359. doi: 10.1038/s41598-024-75977-y.

Authors

Joo Seong Kim^#^{1

2}, Doyun Kwon^#³, Kyungdo Kim^#^{4

5}, Sang Hyub Lee^#⁶, Seung-Bo Lee^#⁷, Kwangsoo Kim^{5

8}, Dongmin Kim⁹, Min Woo Lee¹, Namyoung Park¹⁰, Jin Ho Choi¹, Eun Sun Jang¹¹, In Rae Cho¹, Woo Hyun Paik¹, Jun Kyu Lee², Ji Kon Ryu¹, Yong-Tae Kim¹

Affiliations

¹ Department of Internal Medicine and Liver Research Institute, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, Korea.
² Department of Internal Medicine, Dongguk University College of Medicine, Dongguk University Ilsan Hospital, Goyang-si, Korea.
³ Interdisciplinary Program of Medical Informatics, Seoul National University College of Medicine, Seoul, Korea.
⁴ Department of Biomedical Engineering, Pratt School of Engineering, Duke University, Durham, NC, 27708, USA.
⁵ Transdisciplinary Department of Medicine & Advanced Technology, Seoul National University Hospital, Seoul, Korea.
⁶ Department of Internal Medicine and Liver Research Institute, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, Korea. gidoctor@snu.ac.kr.
⁷ Department of Medical Informatics, Keimyung University School of Medicine, 1095, Dalgubeol-daero, Dalseo-gu, Daegu, 42601, Republic of Korea. koreateam23@gmail.com.
⁸ Department of Medicine, Seoul National University College of Medicine, Seoul, Korea.
⁹ Biomedical Research Institute, Seoul National University Hospital, Seoul, Korea.
¹⁰ Department of Medicine, Kyung Hee University Gangdong Hospital, Seoul, Korea.
¹¹ Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam-si, Korea.

^# Contributed equally.

Abstract

This study aimed to develop a machine learning (ML) model for predicting pulmonary embolism (PE) in patients with gastrointestinal cancers, a group at increased risk for PE. We conducted a retrospective, multicenter study analyzing patients who underwent computed tomographic pulmonary angiography (CTPA) between 2010 and 2020. The study utilized demographic and clinical data, including the Wells score and D-dimer levels, to train a random forest ML model. The model's effectiveness was assessed using the area under the receiver operating curve (AUROC). In total, 446 patients from hospital A and 139 from hospital B were included. The training set consisted of 356 patients from hospital A, with internal validation on 90 and external validation on 139 patients from hospital B. The model achieved an AUROC of 0.736 in hospital A and 0.669 in hospital B. The ML model significantly reduced the number of patients recommended for CTPA compared to the conventional diagnostic strategy (hospital A; 100.0% vs. 91.1%, P < 0.001, hospital B; 100.0% vs. 93.5%, P = 0.003). The results indicate that an ML-based prediction model can reduce unnecessary CTPA procedures in gastrointestinal cancer patients, highlighting its potential to enhance diagnostic efficiency and reduce patient burden.

Keywords: Computed tomographic pulmonary angiography; Gastrointestinal cancer; Machine learning; Pulmonary embolism; Random forest model.

Publication types

Multicenter Study

MeSH terms

Aged
Computed Tomography Angiography / methods
Female
Gastrointestinal Neoplasms* / diagnostic imaging
Humans
Machine Learning*
Male
Middle Aged
Pulmonary Embolism* / diagnostic imaging
ROC Curve
Retrospective Studies
Tomography, X-Ray Computed / methods
Unnecessary Procedures / statistics & numerical data

Grants and funding

0420212090/Seoul National University Hospital