Combinatorial DNA Rearrangement Facilitates the Origin of New Genes in Ciliates

Genome Biol Evol. 2015 Sep 2;7(10):2859-70. doi: 10.1093/gbe/evv172.

Abstract

Programmed genome rearrangements in the unicellular eukaryote Oxytricha trifallax produce a transcriptionally active somatic nucleus from a copy of its germline nucleus during development. This process eliminates noncoding sequences that interrupt coding regions in the germline genome, and joins over 225,000 remaining DNA segments, some of which require inversion or complex permutation to build functional genes. This dynamic genomic organization permits some single DNA segments in the germline to contribute to multiple, distinct somatic genes via alternative processing. Like alternative mRNA splicing, the combinatorial assembly of DNA segments contributes to genetic variation and facilitates the evolution of new genes. In this study, we use comparative genomic analysis to demonstrate that the emergence of alternative DNA splicing is associated with the origin of new genes. Short duplications give rise to alternative gene segments that are spliced to the shared gene segments. Alternative gene segments evolve faster than shared, constitutive segments. Genes with shared segments frequently have different expression profiles, permitting functional divergence. This study reports alternative DNA splicing as a mechanism of new gene origination, illustrating how the process of programmed genome rearrangement gives rise to evolutionary innovation.

Keywords: alternative splicing; comparative genomics; gene duplication; genome rearrangement; novel genes.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Alternative Splicing
  • Base Sequence
  • Cell Nucleus / genetics
  • Comparative Genomic Hybridization
  • DNA, Protozoan / genetics*
  • Evolution, Molecular
  • Gene Duplication
  • Gene Expression Regulation
  • Gene Rearrangement*
  • Genes, Protozoan
  • Germ Cells / growth & development
  • Germ Cells / physiology
  • Molecular Sequence Data
  • Oxytricha / genetics*
  • Phylogeny
  • Sequence Analysis, DNA
  • Sequence Inversion
  • Transcriptome

Substances

  • DNA, Protozoan

Associated data

  • GENBANK/ADNZ03000000
  • GENBANK/LAEC00000000
  • GENBANK/LASQ02000000
  • GENBANK/LASR02000000
  • GENBANK/LASS02000000
  • GENBANK/LAST02000000
  • GENBANK/LASU02000000