Gene-set association tests for next-generation sequencing data

Bioinformatics. 2016 Sep 1;32(17):i611-i619. doi: 10.1093/bioinformatics/btw429.

Abstract

Motivation: Recently, many methods have been developed for conducting rare-variant association studies for sequencing data. These methods have primarily been based on gene-level associations but have not been proven to be as effective as expected. Gene-set-level tests have shown great advantages over gene-level tests in terms of power and robustness, because complex diseases are often caused by multiple genes that comprise of biological gene sets.

Results: Here, we propose several novel gene-set tests that employ rapid and efficient dimensionality reduction. The performance of these tests was investigated using extensive simulations and application to 1058 whole-exome sequences from a Korean population. We identified some known pathways and novel pathways whose rare or common variants are associated with elevated liver enzymes and replicated the results in an independent cohort.

Availability and implementation: Source R code for our algorithm is freely available at http://statgen.snu.ac.kr/software/QTest

Contact: tspark@stats.snu.ac.kr

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Algorithms*
  • Datasets as Topic
  • Exome
  • Genetic Association Studies
  • Genetic Testing
  • Genetic Variation
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • Korea
  • Metabolic Networks and Pathways