Identification of human haploinsufficient genes and their genomic proximity to segmental duplications

Eur J Hum Genet. 2008 Nov;16(11):1350-7. doi: 10.1038/ejhg.2008.111. Epub 2008 Jun 4.

Abstract

Despite the significance of haploinsufficiency in human disease, no systematic study has been reported into the types of genes that are haploinsufficient in human, or into the mechanisms that commonly lead to their deletion and to the expression of the haploinsufficient phenotype. We have applied a rigorous text-searching and database-mining strategy to extract, as comprehensively as possible, from PubMed and OMIM an annotated list of currently known human haploinsufficient genes, including their functions and associated diseases. Gene-set enrichment analysis shows that genes-encoding transcription factors, and genes that function in development, the cell cycle, and nucleic acid metabolism are overrepresented among haploinsufficient genes in human. Many of the phenotypes associated with loss-of-function or deletion of one copy of a haploinsufficient gene describe mental retardation, developmental or metabolic disorders, or tumourigenesis. We also found that haploinsufficient genes are less likely than the complete set of human genes to be situated between pairs of segmental duplications (SDs) that are in close proximity to each other on the same chromosome. Given that SDs can initiate non-allelic homologous recombination (NAHR) and the deletion of adjacent genomic regions, this suggests that the location of haploinsufficient genes between SD pairs, from whence they may suffer intra-genomic rearrangement and loss, is selectively disadvantageous. We describe a custom-made Java visualization tool, HaploGeneMapper, to aid in visualizing the proximity of human haploinsufficient genes to SDs and to enable identification of haploinsufficient genes that are vulnerable to NAHR-mediated deletion.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Chromosomes, Human / genetics*
  • Gene Duplication*
  • Genetic Diseases, Inborn / genetics*
  • Genome, Human / genetics*
  • Humans
  • PubMed
  • Software