The SARS-CoV-2 subgenome landscape and its novel regulatory features

Mol Cell. 2021 May 20;81(10):2135-2147.e5. doi: 10.1016/j.molcel.2021.02.036. Epub 2021 Mar 3.

Abstract

Coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), is currently a global pandemic. CoVs are known to generate negative subgenomes (subgenomic RNAs [sgRNAs]) through transcription-regulating sequence (TRS)-dependent template switching, but the global dynamic landscapes of coronaviral subgenomes and regulatory rules remain unclear. Here, using next-generation sequencing (NGS) short-read and Nanopore long-read poly(A) RNA sequencing in two cell types at multiple time points after infection with SARS-CoV-2, we identified hundreds of template switches and constructed the dynamic landscapes of SARS-CoV-2 subgenomes. Interestingly, template switching could occur in a bidirectional manner, with diverse SARS-CoV-2 subgenomes generated from successive template-switching events. The majority of template switches result from RNA-RNA interactions, including seed and compensatory modes, with terminal pairing status as a key determinant. Two TRS-independent template switch modes are also responsible for subgenome biogenesis. Our findings reveal the subgenome landscape of SARS-CoV-2 and its regulatory features, providing a molecular basis for understanding subgenome biogenesis and developing novel anti-viral strategies.

Keywords: COVID-19; Nanopore sequencing; RNA pairing; SARS-CoV-2; biogenesis; coronavirus; sgRNA; subgenome; template switch.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • COVID-19* / genetics
  • COVID-19* / metabolism
  • Caco-2 Cells
  • Chlorocebus aethiops
  • Genome, Viral*
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • RNA, Viral* / genetics
  • RNA, Viral* / metabolism
  • SARS-CoV-2* / genetics
  • SARS-CoV-2* / metabolism
  • Vero Cells

Substances

  • RNA, Viral