Mining associations between genetic markers, phenotypes, and covariates

Genet Epidemiol. 2001:21 Suppl 1:S588-93. doi: 10.1002/gepi.2001.21.s1.s588.

Abstract

We used Haplotype Pattern Mining, HPM [Toivonen et al., Am J Hum Genet 67:133-45, 2000], for gene localization in Genetic Analysis Workshop (GAW) 12 isolate data. In HPM, association is analyzed by searching all trait-associated haplotype patterns. Data mining algorithms are utilized to make the search efficient. The strength of the haplotype-trait associations is measured by a linear model, into which a pre-seelected set of covariates is incorporated. Marker-wise patterns of association are used for predicting the disease gene location. Genome-wide scans of susceptibility genes for affection status as well as for the quantitative traits (Q1-Q5) were performed. First analyses were made with small sample sizes, 63-94 trios per trait, which is compared with a pilot study of a larger complex disease-mapping project. Subsequently, the analysis was repeated with approximately 600 cases and 600 controls per trait to give higher power to the analyses. With small sample sizes, only the susceptibility genes having the strongest effects on the traits could be localized. The larger sample size gave very good results: all susceptibility genes, except one, could be correctly localized. First experiments on candidate genes suggested that HPM is applicable even to fine mapping of mutations in DNA sequence.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Analysis of Variance
  • Chromosome Mapping / statistics & numerical data*
  • Chromosomes, Human, Pair 10
  • Chromosomes, Human, Pair 6
  • Genetic Markers / genetics*
  • Genetic Predisposition to Disease / genetics
  • Haplotypes / genetics*
  • Humans
  • Mathematical Computing
  • Models, Genetic*
  • Phenotype*
  • Quantitative Trait, Heritable
  • Software

Substances

  • Genetic Markers