Quantitative trait prediction based on genetic marker-array data, a simulation study

Bioinformatics. 2011 Mar 15;27(6):745-8. doi: 10.1093/bioinformatics/btr024. Epub 2011 Jan 31.

Abstract

Using simulation studies for quantitative trait loci (QTL), we evaluate the prediction quality of regression models that include as covariates single-nucleotide polymorphism (SNP) genetic markers which did not achieve genome-wide significance in the original genome-wide association study, but were among the SNPs with the smallest P-value for the selected association test. We compare the results of such regression models to the standard approach which is to include only SNPs that achieve genome-wide significance. Using mean square prediction error as the model metric, our simulation results suggest that by using the coefficient of determination (R(2)) value as a guideline to increase or reduce the number of SNPs included in the regression model, we can achieve better prediction quality than the standard approach. However, important parameters such as trait heritability, the approximate number of QTLs, etc. have to be determined from previous studies or have to be estimated accurately.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Chromosome Mapping / methods
  • Computer Simulation
  • Genetic Markers
  • Genome-Wide Association Study
  • Genotype
  • Humans
  • Inheritance Patterns
  • Models, Genetic*
  • Phenotype
  • Polymorphism, Single Nucleotide*
  • Quantitative Trait Loci*
  • Regression Analysis

Substances

  • Genetic Markers