Investigation of mutations in the HBB gene using the 1,000 genomes database

PLoS One. 2017 Apr 5;12(4):e0174637. doi: 10.1371/journal.pone.0174637. eCollection 2017.

Abstract

Mutations in the HBB gene are responsible for several serious hemoglobinopathies, such as sickle cell anemia and β-thalassemia. Sickle cell anemia is one of the most common monogenic diseases worldwide. Due to its prevalence, diverse strategies have been developed for a better understanding of its molecular mechanisms. In silico analysis has been increasingly used to investigate the genotype-phenotype relationship of many diseases, and the sequences of healthy individuals deposited in the 1,000 Genomes database appear to be an excellent tool for such analysis. The objective of this study is to analyze the variations in the HBB gene in the 1,000 Genomes database, to describe the mutation frequencies in the different population groups, and to investigate the pattern of pathogenicity. The computational tool SNPEFF was used to align the data from 2,504 samples of the 1,000 Genomes database with the HG19 genome reference. The pathogenicity of each amino acid change was investigated using the databases CLINVAR, dbSNP and HbVar and five different predictors. Twenty different mutations were found in 209 healthy individuals. The African group had the highest number of individuals with mutations, and the European group had the lowest number. Thus, it is concluded that approximately 8.3% of phenotypically healthy individuals from the 1,000 Genomes database have some mutation in the HBB gene. The frequency of mutated genes was estimated at 0.042, so that the expected frequency of being homozygous or compound heterozygous for these variants in the next generation is approximately 0.002. In total, 193 subjects had a non-synonymous mutation, which 186 (7.4%) have a deleterious mutation. Considering that the 1,000 Genomes database is representative of the world's population, it can be estimated that fourteen out of every 10,000 individuals in the world will have a hemoglobinopathy in the next generation.

MeSH terms

  • Alleles
  • Amino Acid Substitution / genetics
  • Black People / genetics
  • Databases, Genetic
  • Genome, Human / genetics*
  • Humans
  • Mutation / genetics*
  • Polymorphism, Single Nucleotide / genetics
  • Sequence Alignment
  • White People / genetics
  • beta-Globins / genetics*

Substances

  • beta-Globins

Grants and funding

This study was supported by Rede de Pesquisa em Genômica Populacional Humana (RPGPH) - 3381/2013 CAPES-BioComputacional, FADESP/PROPESP/UFPA (Universidade Federal do Pará), FAPESPA (Fundacão Amazonia Paraense de Amparo à Pesquisa) ICAAF 083/2013, and CNPq (Conselho Nacional de Desenvolvimento Científico e Tecnológico). ÂNDREA RIBEIRO-DOS-SANTOS supported by CNPq/Produtividade (CNPQ 304413/2015-1); SIDNEY SANTOS supported by CNPq/Produtividade (CNPq 305258/2013-3). The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.