Long-Read-Based Genome Assembly Reveals Numerous Endogenous Viral Elements in the Green Algal Bacterivore Cymbomonas tetramitiformis

Genome Biol Evol. 2023 Nov 1;15(11):evad194. doi: 10.1093/gbe/evad194.

Abstract

The marine tetraflagellate Cymbomonas tetramitiformis has drawn attention as an early diverging green alga that uses a phago-mixotrophic mode of nutrition (i.e., the ability to derive nourishment from both photosynthesis and bacterial prey). The Cymbomonas nuclear genome was sequenced previously, but due to the exclusive use of short-read (Illumina) data, the assembly suffered from missing a large proportion of the genome's repeat regions. For this study, we generated Oxford Nanopore long-read and additional short-read Illumina data and performed a hybrid assembly that significantly improved the total assembly size and contiguity. Numerous endogenous viral elements were identified in the repeat regions of the new assembly. These include the complete genome of a giant Algavirales virus along with many genomes of integrated Polinton-like viruses (PLVs) from two groups: Gezel-like PLVs and a novel group of prasinophyte-specific PLVs. The integrated ∼400 kb genome of the giant Algavirales virus is the first account of the association of the uncultured viral family AG_03 with green algae. The complete PLV genomes from C. tetramitiformis ranged between 15 and 25 kb in length and showed a diverse gene content. In addition, heliorhodopsin gene-containing repeat elements of putative mirusvirus origin were identified. These results illustrate past (and possibly ongoing) multiple alga-virus interactions that accompanied the genome evolution of C. tetramitiformis.

Keywords: MCP; MinION; NCLDV; mixotroph; phagotroph; polinton; prasinophyte.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Chlorophyta* / genetics
  • Genome
  • Genome, Viral
  • High-Throughput Nucleotide Sequencing / methods
  • Photosynthesis
  • Sequence Analysis, DNA / methods
  • Viruses* / genetics