Mixture models for linkage analysis of affected sibling pairs and covariates

Genet Epidemiol. 2002 Jan;22(1):52-65. doi: 10.1002/gepi.1043.

Abstract

To determine the genetic etiology of complex diseases, a common study design is to recruit affected sib/relative pairs (ASP/ARP) and evaluate their genome-wide distribution of identical by descent (IBD) sharing using a set of highly polymorphic markers. Other attributes or environmental exposures of the ASP/ARP, which are thought to affect liability to disease, are sometimes collected. Conceivably, these covariates could refine the linkage analysis. Most published methods for ASP/ARP linkage with covariates can be conceptualized as logistic models in which IBD status of the ASP is predicted by pair-specific covariates. We develop a different approach to the problem of ASP analysis in the presence of covariates, one that extends naturally to ARP under certain conditions. For ASP linkage analysis, we formulate a mixture model in which a disease mutation is segregating in only a fraction alpha of the sibships, with 1 - alpha sibships being unlinked. Covariate information is used to predict membership within groups; in this report, the two groups correspond to the linked and unlinked sibships. For an ASP with covariate(s) Z = z and multilocus genotype X = x, the mixture model is alpha(z)g(x; lambda) + [1 - alpha(z)]g(0)(x), in which g(0)(x) follows the distribution of genotypes under the null IBD distribution and g(x; lambda) allows for increased IBD sharing. Two mixture models are developed. The pre-clustering model uses covariate information to form probabilistic clusters and then tests for excess IBD sharing independent of the covariates. The Cov-IBD model determines probabilistic group membership by joint consideration of covariate and IBD values. Simulations show that incorporating covariates into linkage analysis can enhance power substantially. A feature of our conceptualization of ASP linkage analysis, with covariates, is that it is apparent how data analysis might evaluate covariates prior to the linkage analysis, thus avoiding the loss of power described by Leal and Ott [2000] when data are stratified.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms
  • Chi-Square Distribution
  • Computer Simulation
  • Genetic Linkage*
  • Genetic Markers
  • Humans
  • Likelihood Functions
  • Models, Genetic*
  • Nuclear Family
  • Research Design

Substances

  • Genetic Markers