Multivariate testing and effect size measures for batch effect evaluation in radiomic features

Sci Rep. 2024 Jun 17;14(1):13923. doi: 10.1038/s41598-024-64208-z.

Abstract

While precision medicine applications of radiomics analysis are promising, differences in image acquisition can cause "batch effects" that reduce reproducibility and affect downstream predictive analyses. Harmonization methods such as ComBat have been developed to correct these effects, but evaluation methods for quantifying batch effects are inconsistent. In this study, we propose the use of the multivariate statistical test PERMANOVA and the Robust Effect Size Index (RESI) to better quantify and characterize batch effects in radiomics data. We evaluate these methods in both simulated and real radiomics features extracted from full-field digital mammography (FFDM) data. PERMANOVA demonstrated higher power than standard univariate statistical testing, and RESI was able to interpretably quantify the effect size of site at extremely large sample sizes. These methods show promise as more powerful and interpretable methods for the detection and quantification of batch effects in radiomics studies.

MeSH terms

  • Breast Neoplasms / diagnostic imaging
  • Female
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Mammography* / methods
  • Multivariate Analysis
  • Radiomics
  • Reproducibility of Results