Breast US computer-aided diagnosis system: robustness across urban populations in South Korea and the United States

Nicholas P Gruszauskas; Karen Drukker; Maryellen L Giger; Ruey-Feng Chang; Charlene A Sennett; Woo Kyung Moon; Lorenzo L Pesce

doi:10.1148/radiol.2533090280

Breast US computer-aided diagnosis system: robustness across urban populations in South Korea and the United States

Radiology. 2009 Dec;253(3):661-71. doi: 10.1148/radiol.2533090280. Epub 2009 Oct 28.

Authors

Nicholas P Gruszauskas¹, Karen Drukker, Maryellen L Giger, Ruey-Feng Chang, Charlene A Sennett, Woo Kyung Moon, Lorenzo L Pesce

Affiliation

¹ Department of Radiology, University of Chicago, 5841 S Maryland Ave, MC 2026, Chicago, IL 60637, USA. ngrusz1@uchicago.edu

Abstract

Purpose: To evaluate the robustness of a breast ultrasonographic (US) computer-aided diagnosis (CAD) system in terms of its performance across different patient populations.

Materials and methods: Three US databases were analyzed for this study: one South Korean and two United States databases. All three databases were utilized in an institutional review board-approved and HIPAA-compliant manner. Round-robin analysis and independent testing were performed to evaluate the performance of a computerized breast cancer classification scheme across the databases. Receiver operating characteristic (ROC) analysis was used to evaluate performance differences.

Results: The round-robin analyses of each database demonstrated similar results, with areas under the ROC curve ranging from 0.88 (95% confidence interval [CI]: 0.820, 0.918) to 0.91 (95% CI: 0.86, 0.95). The independent testing of each database, however, indicated that although the performances were similar, the range in areas under the ROC curve (from 0.79 [95% CI: 0.730, 0.842] to 0.87 [95% CI: 0.794, 0.923]) was wider than that with the round-robin tests. However, the only instances in which statistically significant differences in performance were demonstrated occurred when the Korean database was used in a testing capacity in independent testing.

Conclusion: The few observed statistically significant differences in performance indicated that while the US features used by the system were useful across the databases, their relative importance differed. In practice, this means that a CAD system may need to be adjusted when applied to a different population.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't

MeSH terms

Bayes Theorem
Breast Neoplasms / diagnostic imaging*
Breast Neoplasms / epidemiology
Diagnosis, Computer-Assisted*
Female
Humans
ROC Curve
Republic of Korea / epidemiology
Statistics, Nonparametric
Ultrasonography, Mammary*
United States / epidemiology
Urban Population

Abstract

Publication types

MeSH terms

Grants and funding