Nucleomorph genome of Hemiselmis andersenii reveals complete intron loss and compaction as a driver of protein structure and function

Proc Natl Acad Sci U S A. 2007 Dec 11;104(50):19908-13. doi: 10.1073/pnas.0707419104. Epub 2007 Dec 6.

Abstract

Nucleomorphs are the remnant nuclei of algal endosymbionts that took up residence inside a nonphotosynthetic eukaryotic host. The nucleomorphs of cryptophytes and chlorarachniophytes are derived from red and green algal endosymbionts, respectively, and represent a stunning example of convergent evolution: their genomes have independently been reduced and compacted to <1 megabase pairs (Mbp) in size (the smallest nuclear genomes known) and to a similar three-chromosome architecture. The molecular processes underlying genome reduction and compaction in eukaryotes are largely unknown, as is the impact of reduction/compaction on protein structure and function. Here, we present the complete 0.572-Mbp nucleomorph genome of the cryptophyte Hemiselmis andersenii and show that it is completely devoid of spliceosomal introns and genes for splicing RNAs-a case of complete intron loss in a nuclear genome. Comparison of H. andersenii proteins to those encoded in the slightly smaller (0.551-Mbp) nucleomorph genome of another cryptophyte, Guillardia theta, and to their homologs in the unicellular red alga Cyanidioschyzon merolae reveal that (i) cryptophyte nucleomorph genomes encode proteins that are significantly smaller than those in their free-living algal ancestors, and (ii) the smaller, more compact G. theta nucleomorph genome encodes significantly smaller proteins than that of H. andersenii. These results indicate that genome compaction can eliminate both coding and noncoding DNA and, consequently, drive the evolution of protein structure and function. Nucleomorph proteins have the potential to reveal the minimal functional units required for basic eukaryotic cellular processes.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algal Proteins / chemistry*
  • Algal Proteins / genetics*
  • Algal Proteins / physiology
  • Cell Nucleus / genetics*
  • Cryptophyta / chemistry*
  • Cryptophyta / genetics*
  • Evolution, Molecular*
  • Genome*
  • Introns / genetics*
  • Molecular Sequence Data
  • Structure-Activity Relationship
  • Symbiosis

Substances

  • Algal Proteins

Associated data

  • GENBANK/CP000881
  • GENBANK/CP000882
  • GENBANK/CP000883