VODKA2: A fast and accurate method to detect non-standard viral genomes from large RNA-seq datasets

bioRxiv [Preprint]. 2023 Jul 15:2023.04.25.537842. doi: 10.1101/2023.04.25.537842.

Abstract

During viral replication, viruses carrying an RNA genome produce non-standard viral genomes (nsVGs), including copy-back viral genomes (cbVGs) and deletion viral genomes (delVGs), that play a crucial role in regulating viral replication and pathogenesis. Because of their critical roles in determining the outcome of RNA virus infections, the study of nsVGs has flourished in recent years exposing a need for bioinformatic tools that can accurately identify them within Next-Generation Sequencing data obtained from infected samples. Here, we present our data analysis pipeline, Viral Opensource DVG Key Algorithm2 (VODKA2), that is optimized to run on a High Performance Computing (HPC) environment for fast and accurate detection of nsVGs from large data sets.

Publication types

  • Preprint