Comparative Pathogenomics of Escherichia coli: Polyvalent Vaccine Target Identification through Virulome Analysis

Infect Immun. 2021 Jul 15;89(8):e0011521. doi: 10.1128/IAI.00115-21. Epub 2021 Jul 15.

Abstract

Comparative genomics of bacterial pathogens has been useful for revealing potential virulence factors. Escherichia coli is a significant cause of human morbidity and mortality worldwide but can also exist as a commensal in the human gastrointestinal tract. With many sequenced genomes, it has served as a model organism for comparative genomic studies to understand the link between genetic content and potential for virulence. To date, however, no comprehensive analysis of its complete "virulome" has been performed for the purpose of identifying universal or pathotype-specific targets for vaccine development. Here, we describe the construction of a pathotype database of 107 well-characterized completely sequenced pathogenic and nonpathogenic E. coli strains, which we annotated for major virulence factors (VFs). The data are cross referenced for patterns against pathotype, phylogroup, and sequence type, and the results were verified against all 1,348 complete E. coli chromosomes in the NCBI RefSeq database. Our results demonstrate that phylogroup drives many of the "pathotype-associated" VFs, and ExPEC-associated VFs are found predominantly within the B2/D/F/G phylogenetic clade, suggesting that these phylogroups are better adapted to infect human hosts. Finally, we used this information to propose polyvalent vaccine targets with specificity toward extraintestinal strains, targeting key invasive strategies, including immune evasion (group 2 capsule), iron acquisition (FyuA, IutA, and Sit), adherence (SinH, Afa, Pap, Sfa, and Iha), and toxins (Usp, Sat, Vat, Cdt, Cnf1, and HlyA). While many of these targets have been proposed before, this work is the first to examine their pathotype and phylogroup distribution and how they may be targeted together to prevent disease.

Keywords: Escherichia coli; ExPEC; InPEC; comparative genomics; enteric pathogens; genomics; pathogenesis; pathogenomics; vaccine development; vaccines; virulence factors.

MeSH terms

  • Animals
  • Bacterial Vaccines / immunology
  • Escherichia coli / genetics*
  • Escherichia coli / pathogenicity
  • Escherichia coli Infections / immunology
  • Escherichia coli Infections / microbiology*
  • Escherichia coli Infections / prevention & control
  • Escherichia coli Proteins / genetics
  • Extraintestinal Pathogenic Escherichia coli / genetics
  • Genes, Bacterial
  • Genome, Bacterial*
  • Genomics*
  • Humans
  • Vaccines, Combined / immunology
  • Virulence / genetics
  • Virulence Factors / genetics

Substances

  • Bacterial Vaccines
  • Escherichia coli Proteins
  • Vaccines, Combined
  • Virulence Factors