SigFuge: single gene clustering of RNA-seq reveals differential isoform usage among cancer samples

Nucleic Acids Res. 2014 Aug;42(14):e113. doi: 10.1093/nar/gku521. Epub 2014 Jul 16.

Abstract

High-throughput sequencing technologies, including RNA-seq, have made it possible to move beyond gene expression analysis to study transcriptional events including alternative splicing and gene fusions. Furthermore, recent studies in cancer have suggested the importance of identifying transcriptionally altered loci as biomarkers for improved prognosis and therapy. While many statistical methods have been proposed for identifying novel transcriptional events with RNA-seq, nearly all rely on contrasting known classes of samples, such as tumor and normal. Few tools exist for the unsupervised discovery of such events without class labels. In this paper, we present SigFuge for identifying genomic loci exhibiting differential transcription patterns across many RNA-seq samples. SigFuge combines clustering with hypothesis testing to identify genes exhibiting alternative splicing, or differences in isoform expression. We apply SigFuge to RNA-seq cohorts of 177 lung and 279 head and neck squamous cell carcinoma samples from the Cancer Genome Atlas, and identify several cases of differential isoform usage including CDKN2A, a tumor suppressor gene known to be inactivated in a majority of lung squamous cell tumors. By not restricting attention to known sample stratifications, SigFuge offers a novel approach to unsupervised screening of genetic loci across RNA-seq cohorts. SigFuge is available as an R package through Bioconductor.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing
  • Carcinoma, Squamous Cell / genetics
  • Carrier Proteins / genetics
  • Cluster Analysis
  • Exons
  • Gene Expression Profiling / methods*
  • Genes, p16
  • Genetic Loci
  • Head and Neck Neoplasms / genetics
  • High-Throughput Nucleotide Sequencing / methods*
  • Intracellular Signaling Peptides and Proteins
  • Kallikreins / genetics
  • Lung Neoplasms / genetics
  • Neoplasms / genetics*
  • Nuclear Proteins
  • RNA Isoforms / metabolism*
  • Sequence Analysis, RNA / methods*
  • Software*
  • Squamous Cell Carcinoma of Head and Neck

Substances

  • Carrier Proteins
  • Intracellular Signaling Peptides and Proteins
  • Nuclear Proteins
  • PIMREG protein, human
  • RNA Isoforms
  • KLK12 protein, human
  • Kallikreins