Meta-analysis approaches to combine multiple gene set enrichment studies

Stat Med. 2018 Feb 20;37(4):659-672. doi: 10.1002/sim.7540. Epub 2017 Oct 19.

Abstract

In the field of gene set enrichment analysis (GSEA), meta-analysis has been used to integrate information from multiple studies to present a reliable summarization of the expanding volume of individual biomedical research, as well as improve the power of detecting essential gene sets involved in complex human diseases. However, existing methods, Meta-Analysis for Pathway Enrichment (MAPE), may be subject to power loss because of (1) using gross summary statistics for combining end results from component studies and (2) using enrichment scores whose distributions depend on the set sizes. In this paper, we adapt meta-analysis approaches recently developed for genome-wide association studies, which are based on fixed effect and random effects (RE) models, to integrate multiple GSEA studies. We further develop a mixed strategy via adaptive testing for choosing RE versus FE models to achieve greater statistical efficiency as well as flexibility. In addition, a size-adjusted enrichment score based on a one-sided Kolmogorov-Smirnov statistic is proposed to formally account for varying set sizes when testing multiple gene sets. Our methods tend to have much better performance than the MAPE methods and can be applied to both discrete and continuous phenotypes. Specifically, the performance of the adaptive testing method seems to be the most stable in general situations.

Keywords: GSEA; MAPE; adjusted Kolmogorov-Smirnov statistic; between-study heterogeneity; fixed effect; generalized linear model; integrative GSEA; random effects.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Biostatistics
  • Computer Simulation
  • Gene Expression Profiling / statistics & numerical data
  • Gene Regulatory Networks*
  • Genome-Wide Association Study / statistics & numerical data
  • Humans
  • Linear Models
  • Lung Neoplasms / genetics
  • Meta-Analysis as Topic*
  • Models, Genetic
  • Models, Statistical
  • ROC Curve