Next-generation sequencing reveals differentially amplified tandem repeats as a major genome component of Northern Europe's oldest Camellia japonica

Chromosome Res. 2015 Dec;23(4):791-806. doi: 10.1007/s10577-015-9500-x. Epub 2015 Nov 18.

Abstract

Northern Europe's oldest and largest Camellia japonica growing at the Pillnitz Castle (Germany) for over 200 years is of botanical and cultural importance and is a reference for C. japonica molecular scale analysis. In order to provide a fundament for genome analysis of the genus Camellia, we characterize the C. japonica tandem repeat fraction, constituting 12.5 % of the Pillnitz camellia's genome. A genomic library of the Pillnitz C. japonica was produced and Illumina sequenced to generate 36 Gb of paired-end reads. We performed graph-based read clustering implemented in the RepeatExplorer pipeline to estimate the C. japonica repeat fraction of 73 %. This enabled us to identify and characterize the most prominent satellite DNAs, Camellia japonica satellite 1-4 (CajaSat1-CajaSat4), and the 5S ribosomal DNA (rDNA) by bioinformatics, fluorescent in situ and Southern hybridization. Within the Camellia genus, satellite spreading, array expansion and formation of higher-order structures highlight different modes of repeat evolution. The CajaSat satellites localize at prominent chromosomal sites, including (peri)centromeres and subtelomeres of all chromosomes, thus serving as chromosomal landmarks for their identification. This work provides an insight into the C. japonica chromosome organization and significantly expands the Camellia genomic knowledge, also with respect to the tea plant Camellia sinensis.

Keywords: Camellia japonica; Fluorescent in situ hybridization; RepeatExplorer; Satellite DNA; Tandem repeats.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Camellia / genetics*
  • Chromosomes, Plant
  • Consensus Sequence
  • DNA Methylation
  • DNA, Satellite
  • Genome Components*
  • Genome, Plant*
  • High-Throughput Nucleotide Sequencing
  • Molecular Sequence Data
  • Sequence Alignment
  • Tandem Repeat Sequences*

Substances

  • DNA, Satellite