Exploring data from genetic association studies using Bayesian variable selection and the Dirichlet process: application to searching for gene × gene patterns

Genet Epidemiol. 2012 Sep;36(6):663-74. doi: 10.1002/gepi.21661. Epub 2012 Jul 31.

Abstract

We construct data exploration tools for recognizing important covariate patterns associated with a phenotype, with particular focus on searching for association with gene-gene patterns. To this end, we propose a new variable selection procedure that employs latent selection weights and compare it to an alternative formulation. The selection procedures are implemented in tandem with a Dirichlet process mixture model for the flexible clustering of genetic and epidemiological profiles. We illustrate our approach with the aid of simulated data and the analysis of a real data set from a genome-wide association study.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem*
  • Cluster Analysis
  • Computer Simulation
  • Genetic Association Studies / methods*
  • Genetic Predisposition to Disease
  • Genome-Wide Association Study
  • Humans
  • Lung Neoplasms / genetics
  • Models, Genetic*
  • Models, Statistical
  • Phenotype
  • Polymorphism, Single Nucleotide