Curating the gnomAD database: Report of novel variants in the thyrogobulin gene using in silico bioinformatics algorithms

Mol Cell Endocrinol. 2021 Aug 20:534:111359. doi: 10.1016/j.mce.2021.111359. Epub 2021 Jun 11.

Abstract

Thyroglobulin (TG) is a large glycosylated protein of 2767 amino acids, secreted by the thyrocytes into the follicular lumen. It plays an essential role in the process of thyroid hormone synthesis. TG gene variants lead to permanent congenital hypothyroidism. In the present work, we report a detailed population and bioinformatic prediction analyses of the TG variants indexed in the Genome Aggregation Database (gnomAD). The results showed a clear predominance of nonsense variants in the European (Finnish), European (Non-Finnish) and Ashkenazi Jewish ethnic groups, whereas the splice site variants predominate in South Asian and African/African-American populations. In total, 282 novel TG variants were described (47 missense involving the wild-type cysteine residues, 177 missense located in the ChEL domain and 58 splice site variants) which were not reported in the literature and that would have deleterious effects in prediction programs. In the gnomAD population, the estimated prevalence of heterozygous carriers of the potentially damaging variants was 1:320. In conclusion, we provide an updated and curated reference source for the diagnosis of thyroid disease, mainly to congenital hypothyroidism due to TG deficiency. The identification and characterization of TG variants is undoubtedly a valuable approach to study the TG structure/function relations and an important tool for clinical diagnosis and genetic counseling.

Keywords: Bioinformatic predictor tools; Loss-of-function variants; Missense variants; Thyroglobulin gene; Thyroid diseases.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Alternative Splicing
  • Codon, Nonsense
  • Computational Biology / methods*
  • Congenital Hypothyroidism / genetics*
  • Data Curation
  • Databases, Genetic
  • Ethnicity / genetics*
  • Genetic Variation*
  • Humans
  • Mutation, Missense
  • Protein Domains
  • Thyroglobulin / chemistry
  • Thyroglobulin / genetics*

Substances

  • Codon, Nonsense
  • TG protein, human
  • Thyroglobulin