MutComFocal: an integrative approach to identifying recurrent and focal genomic alterations in tumor samples

BMC Syst Biol. 2013 Mar 25:7:25. doi: 10.1186/1752-0509-7-25.

Abstract

Background: Most tumors are the result of accumulated genomic alterations in somatic cells. The emerging spectrum of alterations in tumors is complex and the identification of relevant genes and pathways remains a challenge. Furthermore, key cancer genes are usually found amplified or deleted in chromosomal regions containing many other genes. Point mutations, on the other hand, provide exquisite information about amino acid changes that could be implicated in the oncogenic process. Current large-scale genomic projects provide high throughput genomic data in a large number of well-characterized tumor samples.

Methods: We define a Bayesian approach designed to identify candidate cancer genes by integrating copy number and point mutation information. Our method exploits the concept that small and recurrent alterations in tumors are more informative in the search for cancer genes. Thus, the algorithm (Mutations with Common Focal Alterations, or MutComFocal) seeks focal copy number alterations and recurrent point mutations within high throughput data from large panels of tumor samples.

Results: We apply MutComFocal to Diffuse Large B-cell Lymphoma (DLBCL) data from four different high throughput studies, totaling 78 samples assessed for copy number alterations by single nucleotide polymorphism (SNP) array analysis and 65 samples assayed for protein changing point mutations by whole exome/whole transcriptome sequencing. In addition to recapitulating known alterations, MutComFocal identifies ARID1B, ROBO2 and MRS1 as candidate tumor suppressors and KLHL6, IL31 and LRP1 as putative oncogenes in DLBCL.

Conclusions: We present a Bayesian approach for the identification of candidate cancer genes by integrating data collected in large number of cancer patients, across different studies. When trained on a well-studied dataset, MutComFocal is able to identify most of the reported characterized alterations. The application of MutComFocal to large-scale cancer data provides the opportunity to pinpoint the key functional genomic alterations in tumors.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Carrier Proteins / genetics
  • DNA-Binding Proteins / genetics
  • Gene Dosage / genetics*
  • Genes, Neoplasm / genetics*
  • Genetic Association Studies / methods
  • Genomics / methods
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Interleukins / genetics
  • Low Density Lipoprotein Receptor-Related Protein-1 / genetics
  • Lymphoma, Large B-Cell, Diffuse / genetics*
  • Models, Genetic*
  • Point Mutation / genetics*
  • Receptors, Immunologic / genetics
  • Systems Biology / methods
  • Transcription Factors / genetics

Substances

  • ARID1B protein, human
  • Carrier Proteins
  • DNA-Binding Proteins
  • IL31 protein, human
  • Interleukins
  • KLHL6 protein, human
  • LRP1 protein, human
  • Low Density Lipoprotein Receptor-Related Protein-1
  • ROBO2 protein, human
  • Receptors, Immunologic
  • Transcription Factors