Multi-scale input layers and dense decoder aggregation network for COVID-19 lesion segmentation from CT scans

Xiaoke Lan; Wenbing Jin

doi:10.1038/s41598-024-74701-0

Multi-scale input layers and dense decoder aggregation network for COVID-19 lesion segmentation from CT scans

Sci Rep. 2024 Oct 10;14(1):23729. doi: 10.1038/s41598-024-74701-0.

Authors

Xiaoke Lan¹, Wenbing Jin²

Affiliations

¹ College of Internet of Things Technology, Hangzhou Polytechnic, Hangzhou, 311402, China. lxk@mail.hzpt.edu.cn.
² College of Internet of Things Technology, Hangzhou Polytechnic, Hangzhou, 311402, China.

Abstract

Accurate segmentation of COVID-19 lesions from medical images is essential for achieving precise diagnosis and developing effective treatment strategies. Unfortunately, this task presents significant challenges, owing to the complex and diverse characteristics of opaque areas, subtle differences between infected and healthy tissue, and the presence of noise in CT images. To address these difficulties, this paper designs a new deep-learning architecture (named MD-Net) based on multi-scale input layers and dense decoder aggregation network for COVID-19 lesion segmentation. In our framework, the U-shaped structure serves as the cornerstone to facilitate complex hierarchical representations essential for accurate segmentation. Then, by introducing the multi-scale input layers (MIL), the network can effectively analyze both fine-grained details and contextual information in the original image. Furthermore, we introduce an SE-Conv module in the encoder network, which can enhance the ability to identify relevant information while simultaneously suppressing the transmission of extraneous or non-lesion information. Additionally, we design a dense decoder aggregation (DDA) module to integrate feature distributions and important COVID-19 lesion information from adjacent encoder layers. Finally, we conducted a comprehensive quantitative analysis and comparison between two publicly available datasets, namely Vid-QU-EX and QaTa-COV19-v2, to assess the robustness and versatility of MD-Net in segmenting COVID-19 lesions. The experimental results show that the proposed MD-Net has superior performance compared to its competitors, and it exhibits higher scores on the Dice value, Matthews correlation coefficient (Mcc), and Jaccard index. In addition, we also conducted ablation studies on the Vid-QU-EX dataset to evaluate the contributions of each key component within the proposed architecture.

Keywords: COVID-19; Dense decoder aggregation; Multi-scale input layers; SE-Conv; Segmentation; U-Net.

MeSH terms

Algorithms
COVID-19* / diagnostic imaging
COVID-19* / virology
Deep Learning*
Humans
Image Processing, Computer-Assisted / methods
Neural Networks, Computer
SARS-CoV-2*
Tomography, X-Ray Computed* / methods