Automated cooling tower detection through deep learning for Legionnaires' disease outbreak investigations: a model development and validation study

Lancet Digit Health. 2024 Jul;6(7):e500-e506. doi: 10.1016/S2589-7500(24)00094-3.

Abstract

Background: Cooling towers containing Legionella spp are a high-risk source of Legionnaires' disease outbreaks. Manually locating cooling towers from aerial imagery during outbreak investigations requires expertise, is labour intensive, and can be prone to errors. We aimed to train a deep learning computer vision model to automatically detect cooling towers that are aerially visible.

Methods: Between Jan 1 and 31, 2021, we extracted satellite view images of Philadelphia (PN, USA) and New York state (NY, USA) from Google Maps and annotated cooling towers to create training datasets. We augmented training data with synthetic data and model-assisted labelling of additional cities. Using 2051 images containing 7292 cooling towers, we trained a two-stage model using YOLOv5, a model that detects objects in images, and EfficientNet-b5, a model that classifies images. We assessed the primary outcomes of sensitivity and positive predictive value (PPV) of the model against manual labelling on test datasets of 548 images, including from two cities not seen in training (Boston [MA, USA] and Athens [GA, USA]). We compared the search speed of the model with that of manual searching by four epidemiologists.

Findings: The model identified visible cooling towers with 95·1% sensitivity (95% CI 94·0-96·1) and a PPV of 90·1% (95% CI 90·0-90·2) in New York City and Philadelphia. In Boston, sensitivity was 91·6% (89·2-93·7) and PPV was 80·8% (80·5-81·2). In Athens, sensitivity was 86·9% (75·8-94·2) and PPV was 85·5% (84·2-86·7). For an area of New York City encompassing 45 blocks (0·26 square miles), the model searched more than 600 times faster (7·6 s; 351 potential cooling towers identified) than did human investigators (mean 83·75 min [SD 29·5]; mean 310·8 cooling towers [42·2]).

Interpretation: The model could be used to accelerate investigation and source control during outbreaks of Legionnaires' disease through the identification of cooling towers from aerial imagery, potentially preventing additional disease spread. The model has already been used by public health teams for outbreak investigations and to initialise cooling tower registries, which are considered best practice for preventing and responding to outbreaks of Legionnaires' disease.

Funding: None.

Publication types

  • Validation Study

MeSH terms

  • Air Conditioning
  • Deep Learning*
  • Disease Outbreaks* / prevention & control
  • Humans
  • Legionella
  • Legionnaires' Disease* / diagnosis
  • Legionnaires' Disease* / epidemiology
  • Legionnaires' Disease* / prevention & control
  • New York / epidemiology
  • Philadelphia / epidemiology
  • Satellite Imagery