Gene content and function of the ancestral chromosome fusion site in human chromosome 2q13-2q14.1 and paralogous regions

Genome Res. 2002 Nov;12(11):1663-72. doi: 10.1101/gr.338402.

Abstract

Various portions of the region surrounding the site where two ancestral chromosomes fused to form human chromosome 2 are duplicated elsewhere in the human genome, primarily in subtelomeric and pericentromeric locations. At least 24 potentially functional genes and 16 pseudogenes reside in the 614-kb of sequence surrounding the fusion site and paralogous segments on other chromosomes. By comparing the sequences of genomic copies and transcripts, we show that at least 18 of the genes in these paralogous regions are transcriptionally active. Among these genes are new members of the cobalamin synthetase W domain (CBWD) and forkhead domain FOXD4 gene families. Copies of RPL23A and SNRPA1 on chromosome 2 are retrotransposed-processed pseudogenes that were included in segmental duplications; we find 53 RPL23A pseudogenes in the human genome and map the functional copy of SNRPA1 to 15qter. The draft sequence of the human genome also provides new information on the location and intron-exon structure of functional copies of other 2q-fusion genes (PGM5, retina-specific F379, helicase CHLR1, and acrosin). This study illustrates that the duplication and rearrangement of subtelomeric and pericentromeric regions have functional relevance to human biology; these processes can change gene dosage and/or generate genes with new functions.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence / genetics
  • Base Sequence / genetics
  • Centromere / genetics
  • Chromosomes, Human, Pair 2 / chemistry*
  • Chromosomes, Human, Pair 2 / physiology*
  • Cytoskeletal Proteins / genetics
  • DNA-Binding Proteins / genetics
  • Evolution, Molecular*
  • Forkhead Transcription Factors
  • Gene Duplication
  • Genes / genetics*
  • Humans
  • Molecular Sequence Data
  • Multigene Family / genetics
  • Nitrogenous Group Transferases / genetics
  • Organ Specificity / genetics
  • Phosphoglucomutase*
  • Protein Structure, Tertiary / genetics
  • Protein Structure, Tertiary / physiology
  • Pseudogenes / genetics
  • Retina / chemistry
  • Retina / metabolism
  • Ribonucleoproteins, Small Nuclear / genetics
  • Ribosomal Proteins / genetics
  • Sequence Homology, Nucleic Acid*
  • Trans-Activators / genetics
  • Translocation, Genetic / genetics*
  • Translocation, Genetic / physiology*

Substances

  • Cytoskeletal Proteins
  • DNA-Binding Proteins
  • FOXD4 protein, human
  • FOXD4L1 protein, human
  • Forkhead Transcription Factors
  • PGM5 protein, human
  • RPL23a protein, human
  • Ribonucleoproteins, Small Nuclear
  • Ribosomal Proteins
  • Trans-Activators
  • Nitrogenous Group Transferases
  • ZNG1B protein, human
  • Phosphoglucomutase

Associated data

  • GENBANK/AF452722
  • GENBANK/AF452723
  • GENBANK/AF452724