Identification of a High-Frequency Intrahost SARS-CoV-2 Spike Variant with Enhanced Cytopathic and Fusogenic Effects

mBio. 2021 Jun 29;12(3):e0078821. doi: 10.1128/mBio.00788-21. Epub 2021 Jun 29.

Abstract

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a virus that is continuously evolving. Although its RNA-dependent RNA polymerase exhibits some exonuclease proofreading activity, viral sequence diversity can be produced by replication errors and host factors. A diversity of genetic variants can be observed in the intrahost viral population structure of infected individuals. Most mutations will follow a neutral molecular evolution and will not make significant contributions to variations within and between infected hosts. Herein, we profiled the intrasample genetic diversity of SARS-CoV-2 variants, also known as quasispecies, using high-throughput sequencing data sets from 15,289 infected individuals and infected cell lines. Despite high mutational background, we identified recurrent intragenetic variable positions in the samples analyzed, including several positions at the end of the gene encoding the viral spike (S) protein. Strikingly, we observed a high frequency of C→A missense mutations resulting in the S protein lacking the last 20 amino acids (SΔ20). We found that this truncated S protein undergoes increased processing and increased syncytium formation, presumably due to escaping M protein retention in intracellular compartments. Our findings suggest the emergence of a high-frequency viral sublineage that is not horizontally transmitted but potentially involved in intrahost disease cytopathic effects. IMPORTANCE The mutation rate and evolution of RNA viruses correlate with viral adaptation. While most mutations do not make significant contributions to viral molecular evolution, some are naturally selected and produce variants through positive selection. Many SARS-CoV-2 variants have been recently described and show phenotypic selection toward more infectious viruses. Our study describes another type of variant that does not contribute to interhost heterogeneity but rather phenotypic selection toward variants that might have increased cytopathic effects. We identified that a C-terminal truncation of the spike protein removes an important endoplasmic reticulum (ER) retention signal, which consequently results in a spike variant that easily travels through the Golgi complex toward the plasma membrane in a preactivated conformation, leading to increased syncytium formation.

Keywords: COVID-19; SARS-CoV-2; genetic variants; high-throughput sequencing; spike protein; syncytia.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • COVID-19 / pathology*
  • Cell Line
  • Evolution, Molecular
  • Genetic Variation / genetics
  • Genome, Viral / genetics*
  • Giant Cells / virology
  • HEK293 Cells
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Mutation Rate
  • SARS-CoV-2 / genetics*
  • SARS-CoV-2 / immunology
  • Spike Glycoprotein, Coronavirus / genetics*
  • Spike Glycoprotein, Coronavirus / immunology

Substances

  • Spike Glycoprotein, Coronavirus
  • spike protein, SARS-CoV-2