A Comprehensive Curation Shows the Dynamic Evolutionary Patterns of Prokaryotic CRISPRs

Biomed Res Int. 2016:2016:7237053. doi: 10.1155/2016/7237053. Epub 2016 Apr 18.

Abstract

Motivation. Clustered regularly interspaced short palindromic repeat (CRISPR) is a genetic element with active regulation roles for foreign invasive genes in the prokaryotic genomes and has been engineered to work with the CRISPR-associated sequence (Cas) gene Cas9 as one of the modern genome editing technologies. Due to inconsistent definitions, the existing CRISPR detection programs seem to have missed some weak CRISPR signals. Results. This study manually curates all the currently annotated CRISPR elements in the prokaryotic genomes and proposes 95 updates to the annotations. A new definition is proposed to cover all the CRISPRs. The comprehensive comparison of CRISPR numbers on the taxonomic levels of both domains and genus shows high variations for closely related species even in the same genus. The detailed investigation of how CRISPRs are evolutionarily manipulated in the 8 completely sequenced species in the genus Thermoanaerobacter demonstrates that transposons act as a frequent tool for splitting long CRISPRs into shorter ones along a long evolutionary history.

MeSH terms

  • CRISPR-Cas Systems / genetics*
  • DNA, Intergenic / genetics
  • Data Curation / methods*
  • Databases, Nucleic Acid
  • Evolution, Molecular*
  • Genome, Bacterial
  • Prokaryotic Cells / metabolism*
  • Repetitive Sequences, Nucleic Acid / genetics

Substances

  • DNA, Intergenic