Genome-wide characterization of simple sequence repeats in Palmae genomes

Genes Genomics. 2020 May;42(5):597-608. doi: 10.1007/s13258-020-00924-w. Epub 2020 Apr 3.

Abstract

Background: Microsatellites or simple sequence repeats (SSRs) have become the most significant DNA marker technology used in genetic research. The availability of complete draft genomes for a number of Palmae species has made it possible to perform genome-wide analysis of SSRs in these species. Palm trees are tropical and subtropical plants with agricultural and economic importance due to the nutritional value of their fruit cultivars.

Objective: This is the first comprehensive study examining and comparing microsatellites in completely-sequenced draft genomes of Palmae species.

Methods: We identified and compared perfect SSRs with 1-6 bp nucleotide motifs to characterize microsatellites in Palmae species using PERF v0.2.5. We analyzed their relative abundance, relative density, and GC content in five palm species: Phoenix dactylifera, Cocos nucifera, Calamus simplicifolius, Elaeis oleifera, and Elaeis guineensis.

Results: A total of 118241, 328189, 450753, 176608, and 70694 SSRs were identified, respectively. The six repeat types were not evenly distributed across the five genomes. Mono- and dinucleotide SSRs were the most abundant, and GC content was highest in tri- and hexanucleotide SSRs.

Conclusion: We envisage that this analysis would further substantiate more in-depth computational, biochemical, and molecular studies on the roles SSRs may play in the genome organization of the palm species. The current study contributes a detailed characterization of simple sequence repeats in palm genomes.

Keywords: Arecaceae; Microsatellite; Molecular marker; Palmae family; SSR abundance.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arecaceae / classification
  • Arecaceae / genetics*
  • Genome, Plant
  • Microsatellite Repeats*
  • Phylogeny