Anaconda: AN automated pipeline for somatic COpy Number variation Detection and Annotation from tumor exome sequencing data

BMC Bioinformatics. 2017 Oct 3;18(1):436. doi: 10.1186/s12859-017-1833-3.

Abstract

Background: Copy number variations (CNVs) are the main genetic structural variations in cancer genome. Detecting CNVs in genetic exome region is efficient and cost-effective in identifying cancer associated genes. Many tools had been developed accordingly and yet these tools lack of reliability because of high false negative rate, which is intrinsically caused by genome exonic bias.

Results: To provide an alternative option, here, we report Anaconda, a comprehensive pipeline that allows flexible integration of multiple CNV-calling methods and systematic annotation of CNVs in analyzing WES data. Just by one command, Anaconda can generate CNV detection result by up to four CNV detecting tools. Associated with comprehensive annotation analysis of genes involved in shared CNV regions, Anaconda is able to deliver a more reliable and useful report in assistance with CNV-associate cancer researches.

Conclusion: Anaconda package and manual can be freely accessed at http://mcg.ustc.edu.cn/bsc/ANACONDA/ .

Keywords: Cancer; Copy number variation; Exome sequencing; Functional analysis.

MeSH terms

  • Algorithms*
  • Automation
  • DNA Copy Number Variations / genetics*
  • Databases, Genetic*
  • Exome / genetics*
  • Exome Sequencing*
  • Exons / genetics
  • Humans
  • Molecular Sequence Annotation*
  • Neoplasms / genetics*
  • Reproducibility of Results