A computational approach for detecting physiological homogeneity in the midst of genetic heterogeneity

Am J Hum Genet. 2021 Jun 3;108(6):1012-1025. doi: 10.1016/j.ajhg.2021.04.023. Epub 2021 May 19.

Abstract

The human genetic dissection of clinical phenotypes is complicated by genetic heterogeneity. Gene burden approaches that detect genetic signals in case-control studies are underpowered in genetically heterogeneous cohorts. We therefore developed a genome-wide computational method, network-based heterogeneity clustering (NHC), to detect physiological homogeneity in the midst of genetic heterogeneity. Simulation studies showed our method to be capable of systematically converging genes in biological proximity on the background biological interaction network, and capturing gene clusters harboring presumably deleterious variants, in an efficient and unbiased manner. We applied NHC to whole-exome sequencing data from a cohort of 122 individuals with herpes simplex encephalitis (HSE), including 13 individuals with previously published monogenic inborn errors of TLR3-dependent IFN-α/β immunity. The top gene cluster identified by our approach successfully detected and prioritized all causal variants of five TLR3 pathway genes in the 13 previously reported individuals. This approach also suggested candidate variants of three reported genes and four candidate genes from the same pathway in another ten previously unstudied individuals. TLR3 responsiveness was impaired in dermal fibroblasts from four of the five individuals tested, suggesting that the variants detected were causal for HSE. NHC is, therefore, an effective and unbiased approach for unraveling genetic heterogeneity by detecting physiological homogeneity.

Keywords: cohort analysis; gene clustering; genetic heterogeneity; herpes simplex encephalitis; incomplete penetrance; network biology; next-generation sequencing; physiological homogeneity; software.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Case-Control Studies
  • Computational Biology / methods*
  • Encephalitis, Herpes Simplex / genetics*
  • Encephalitis, Herpes Simplex / immunology
  • Encephalitis, Herpes Simplex / pathology*
  • Exome Sequencing
  • Fibroblasts / immunology*
  • Fibroblasts / metabolism
  • Gene Regulatory Networks*
  • Genetic Heterogeneity*
  • Genetic Predisposition to Disease*
  • Humans
  • Toll-Like Receptor 3 / genetics
  • Toll-Like Receptor 3 / immunology
  • Toll-Like Receptor 3 / metabolism

Substances

  • TLR3 protein, human
  • Toll-Like Receptor 3