Genetic determinants of polygenic prediction accuracy within a population

Tianyuan Lu; Vincenzo Forgetta; John Brent Richards; Celia M T Greenwood

doi:10.1093/genetics/iyac158

Genetic determinants of polygenic prediction accuracy within a population

Genetics. 2022 Nov 30;222(4):iyac158. doi: 10.1093/genetics/iyac158.

Authors

Tianyuan Lu^{1

2}, Vincenzo Forgetta¹, John Brent Richards^{1

3

4}, Celia M T Greenwood^{1

3

5

6}

Affiliations

¹ Centre for Clinical Epidemiology, Lady Davis Institute for Medical Research, Jewish General Hospital, Montreal, QC H3T 1E2, Canada.
² Quantitative Life Sciences Program, McGill University, Montreal, QC H3A 0G4, Canada.
³ Department of Human Genetics, McGill University, Montreal, QC H3A 0G4, Canada.
⁴ Department of Twin Research and Genetic Epidemiology, King's College London, London WC2R 2LS, UK.
⁵ Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, QC H3A 0G4, Canada.
⁶ Gerald Bronfman Department of Oncology, McGill University, Montreal, QC H3A 0G4, Canada.

Abstract

Genomic risk prediction is on the emerging path toward personalized medicine. However, the accuracy of polygenic prediction varies strongly in different individuals. Based on up to 352,277 European ancestry participants in the UK Biobank, we constructed polygenic risk scores for 15 physiological and biochemical quantitative traits. We identified a total of 185 polygenic prediction variability quantitative trait loci for 11 traits by Levene's test among 254,376 unrelated individuals. We validated the effects of prediction variability quantitative trait loci using an independent test set of 58,927 individuals. For instance, a score aggregating 51 prediction variability quantitative trait locus variants for triglycerides had the strongest Spearman correlation of 0.185 (P-value <1.0 × 10-300) with the squared prediction errors. We found a strong enrichment of complex genetic effects conferred by prediction variability quantitative trait loci compared to risk loci identified in genome-wide association studies, including 89 prediction variability quantitative trait loci exhibiting dominance effects. Incorporation of dominance effects into polygenic risk scores significantly improved polygenic prediction for triglycerides, low-density lipoprotein cholesterol, vitamin D, and platelet. In conclusion, we have discovered and profiled genetic determinants of polygenic prediction variability for 11 quantitative biomarkers. These findings may assist interpretation of genomic risk prediction in various contexts and encourage novel approaches for constructing polygenic risk scores with complex genetic effects.

Keywords: GenPred; Genomic Prediction; Shared Data Resource; dominance; gene-by-environment interaction; genome-wide association study; polygenic risk score; prediction accuracy; quantitative trait loci.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Genetic Predisposition to Disease
Genome-Wide Association Study*
Humans
Multifactorial Inheritance
Polymorphism, Single Nucleotide*
Quantitative Trait Loci
Triglycerides

Genetic determinants of polygenic prediction accuracy within a population

Authors

Affiliations

Abstract

Publication types

MeSH terms

Substances

Associated data

Grants and funding