Sources of variation in multicenter rectal MRI data and their effect on radiomics feature reproducibility

Niels W Schurink; Simon R van Kranen; Sander Roberti; Joost J M van Griethuysen; Nino Bogveradze; Francesca Castagnoli; Najim El Khababi; Frans C H Bakers; Shira H de Bie; Gerlof P T Bosma; Vincent C Cappendijk; Remy W F Geenen; Peter A Neijenhuis; Gerald M Peterson; Cornelis J Veeken; Roy F A Vliegen; Regina G H Beets-Tan; Doenja M J Lambregts

doi:10.1007/s00330-021-08251-8

Sources of variation in multicenter rectal MRI data and their effect on radiomics feature reproducibility

Eur Radiol. 2022 Mar;32(3):1506-1516. doi: 10.1007/s00330-021-08251-8. Epub 2021 Oct 16.

Authors

Niels W Schurink^{1

2}, Simon R van Kranen³, Sander Roberti⁴, Joost J M van Griethuysen^{1

2}, Nino Bogveradze^{1

2

5}, Francesca Castagnoli¹, Najim El Khababi^{1

2}, Frans C H Bakers⁶, Shira H de Bie⁷, Gerlof P T Bosma⁸, Vincent C Cappendijk⁹, Remy W F Geenen¹⁰, Peter A Neijenhuis¹¹, Gerald M Peterson¹², Cornelis J Veeken¹³, Roy F A Vliegen¹⁴, Regina G H Beets-Tan^{15

16}, Doenja M J Lambregts¹⁷

Affiliations

¹ Department of Radiology, The Netherlands Cancer Institute, POB 90203, 1006 BE, Amsterdam, The Netherlands.
² GROW School for Oncology & Developmental Biology, University of Maastricht, Maastricht, The Netherlands.
³ Department of Radiation Oncology, The Netherlands Cancer Institute, Amsterdam, The Netherlands.
⁴ Department of Epidemiology and Biostatistics, The Netherlands Cancer Institute, Amsterdam, The Netherlands.
⁵ Department of Radiology, Acad. F. Todua Medical Center, Research Institute of Clinical Medicine, Tbilisi, Georgia.
⁶ Department of Radiology, Maastricht University Medical Centre, Maastricht, The Netherlands.
⁷ Department of Radiology, Deventer Ziekenhuis, Deventer, The Netherlands.
⁸ Department of Interventional Radiology, Elisabeth Tweesteden Hospital, Tilburg, The Netherlands.
⁹ Department of Radiology, Jeroen Bosch Hospital, 's-Hertogenbosch, The Netherlands.
¹⁰ Department of Radiology, Northwest Clinics, Alkmaar, The Netherlands.
¹¹ Department of Surgery, Alrijne Hospital, Leiderdorp, The Netherlands.
¹² Department of Radiology, Spaarne Gasthuis, Haarlem, The Netherlands.
¹³ Department of Radiology, IJsselland Hospital, Capelle Aan Den IJssel, The Netherlands.
¹⁴ Department of Radiology, Zuyderland Medical Center, Heerlen, The Netherlands.
¹⁵ Department of Radiology, The Netherlands Cancer Institute, POB 90203, 1006 BE, Amsterdam, The Netherlands. r.beetstan@nki.nl.
¹⁶ GROW School for Oncology & Developmental Biology, University of Maastricht, Maastricht, The Netherlands. r.beetstan@nki.nl.
¹⁷ Department of Radiology, The Netherlands Cancer Institute, POB 90203, 1006 BE, Amsterdam, The Netherlands. d.lambregts@nki.nl.

Abstract

Objectives: To investigate sources of variation in a multicenter rectal cancer MRI dataset focusing on hardware and image acquisition, segmentation methodology, and radiomics feature extraction software.

Methods: T2W and DWI/ADC MRIs from 649 rectal cancer patients were retrospectively acquired in 9 centers. Fifty-two imaging features (14 first-order/6 shape/32 higher-order) were extracted from each scan using whole-volume (expert/non-expert) and single-slice segmentations using two different software packages (PyRadiomics/CapTk). Influence of hardware, acquisition, and patient-intrinsic factors (age/gender/cTN-stage) on ADC was assessed using linear regression. Feature reproducibility was assessed between segmentation methods and software packages using the intraclass correlation coefficient.

Results: Image features differed significantly (p < 0.001) between centers with more substantial variations in ADC compared to T2W-MRI. In total, 64.3% of the variation in mean ADC was explained by differences in hardware and acquisition, compared to 0.4% by patient-intrinsic factors. Feature reproducibility between expert and non-expert segmentations was good to excellent (median ICC 0.89-0.90). Reproducibility for single-slice versus whole-volume segmentations was substantially poorer (median ICC 0.40-0.58). Between software packages, reproducibility was good to excellent (median ICC 0.99) for most features (first-order/shape/GLCM/GLRLM) but poor for higher-order (GLSZM/NGTDM) features (median ICC 0.00-0.41).

Conclusions: Significant variations are present in multicenter MRI data, particularly related to differences in hardware and acquisition, which will likely negatively influence subsequent analysis if not corrected for. Segmentation variations had a minor impact when using whole volume segmentations. Between software packages, higher-order features were less reproducible and caution is warranted when implementing these in prediction models.

Key points: • Features derived from T2W-MRI and in particular ADC differ significantly between centers when performing multicenter data analysis. • Variations in ADC are mainly (> 60%) caused by hardware and image acquisition differences and less so (< 1%) by patient- or tumor-intrinsic variations. • Features derived using different image segmentations (expert/non-expert) were reproducible, provided that whole-volume segmentations were used. When using different feature extraction software packages with similar settings, higher-order features were less reproducible.

Keywords: Image processing, Computer-assisted; Magnetic resonance imaging; Multicenter study; Rectal neoplasms; Reproducibility of results.

Publication types

Multicenter Study

MeSH terms

Diffusion Magnetic Resonance Imaging
Humans
Image Processing, Computer-Assisted
Magnetic Resonance Imaging*
Rectal Neoplasms* / diagnostic imaging
Reproducibility of Results
Retrospective Studies

Grants and funding

10138/kwf kankerbestrijding