Evaluation of association tests for rare variants using simulated data sets in the Genetic Analysis Workshop 17 data

BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S86. doi: 10.1186/1753-6561-5-S9-S86. eCollection 2011.

Abstract

We evaluate four association tests for rare variants-the combined multivariate and collapsing (CMC) method, two weighted-sum methods, and a variable threshold method-by applying them to the simulated data sets of unrelated individuals in the Genetic Analysis Workshop 17 (GAW17) data. The family-wise error rate (FWER) and average power are used as criteria for evaluation. Our results show that when all nonsynonymous SNPs (rare variants and common variants) in a gene are jointly analyzed, the CMC method fails to control the FWER; when only rare variants (single-nucleotide polymorphisms with minor allele frequency less than 0.05) are analyzed, all four methods can control FWER well. All four methods have comparable power, which is low for the analysis of the GAW17 data sets. Three of the methods (not including the CMC method) involve estimation of p-values using permutation procedures that either can be computationally intensive or generate inflated FWERs. We adapt a fast permutation procedure into these three methods. The results show that using the fast permutation procedure can produce FWERs and average powers close to the values obtained from the standard permutation procedure on the GAW17 data sets. The standard permutation procedure is computationally intensive.