Inference of tumor phylogenies with improved somatic mutation discovery

J Comput Biol. 2013 Nov;20(11):933-44. doi: 10.1089/cmb.2013.0106.

Abstract

Next-generation sequencing technologies provide a powerful tool for studying genome evolution during progression of advanced diseases such as cancer. Although many recent studies have employed new sequencing technologies to detect mutations across multiple, genetically related tumors, current methods do not exploit available phylogenetic information to improve the accuracy of their variant calls. Here, we present a novel algorithm that uses somatic single-nucleotide variations (SNVs) in multiple, related tissue samples as lineage markers for phylogenetic tree reconstruction. Our method then leverages the inferred phylogeny to improve the accuracy of SNV discovery. Experimental analyses demonstrate that our method achieves up to 32% improvement for somatic SNV calling of multiple, related samples over the accuracy of GATK's Unified Genotyper, the state-of-the-art multisample SNV caller.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computer Simulation
  • DNA Mutational Analysis*
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Models, Genetic
  • Mutation
  • Neoplasms / genetics*
  • Phylogeny
  • Polymorphism, Single Nucleotide*