Coparalogy: physical and functional clusterings in the human genome

Biochem Biophys Res Commun. 2001 Oct 26;288(2):362-70. doi: 10.1006/bbrc.2001.5794.

Abstract

Two rounds of large-scale duplications are thought to have occurred in early vertebrate ancestry; this is now known as the "2R hypothesis." They have led to the constitution of subfamilies of paralogous genes. Chromosomal regions that contain present-day paralogs (paralogous regions or paralogons) have been identified in mammals. We show that sets of paralogons (PGs) can be assembled in a tentative "human genome paralogy map" that includes all autosomes and X. A total of 14 PGs, containing more than 1600 genes, were assembled in this paralogy map. Genes that belong to the same PG are coparalogs. We show that identification of coparalogy can be used (i) to broaden data on gene mapping, (ii) to identify physical gene clusters that derive from early cis-duplications, and (iii) to speculate on coevolution and coregulation of genes sharing a common structure or function (functional clusters). Thus, coparalogy analyses should parallel phylogenetic analyses and can help draw hypotheses on gene and genome evolution.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromosome Mapping
  • Databases, Factual
  • Evolution, Molecular*
  • Genome, Human*
  • Humans
  • Multigene Family*