Functional and structural divergence of an unusual LTR retrotransposon family in plants

PLoS One. 2012;7(10):e48595. doi: 10.1371/journal.pone.0048595. Epub 2012 Oct 31.

Abstract

Retrotransposons with long terminal repeats (LTRs) more than 3 kb are not frequent in most eukaryotic genomes. Rice LTR retrotransposon, Retrosat2, has LTRs greater than 3.2 kb and two open reading frames (ORF): ORF1 encodes enzymes for retrotransposition whereas no function can be assigned to ORF0 as it is not found in any other organism. A variety of experimental and in silico approaches were used to determine the origin of Retrosat2 and putative function of ORF0. Our data show that not only is Retrosat2 highly abundant in the Oryza genus, it may yet be active in rice. Homologs of Retrosat2 were identified in maize, sorghum, Arabidopsis and other plant genomes suggesting that the Retrosat2 family is of ancient origin. Several putatively cis-acting elements, some multicopy, that regulate retrotransposon replication or responsiveness to environmental factors were found in the LTRs of Retrosat2. Unlike the ORF1, the ORF0 sequences from Retrosat2 and homologs are divergent at the sequence level, 3D-structures and predicted biological functions. In contrast to other retrotransposon families, Retrosat2 and its homologs are dispersed throughout genomes and not concentrated in the specific chromosomal regions, such as centromeres. The genomic distribution of Retrosat2 homologs varies across species which likely reflects the differing evolutionary trajectories of this retrotransposon family across diverse species.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Base Sequence
  • Binding Sites / genetics
  • Chromosomes, Plant / genetics
  • DNA, Plant / genetics
  • Genetic Variation*
  • Genome, Plant / genetics
  • In Situ Hybridization, Fluorescence
  • Models, Molecular
  • Open Reading Frames / genetics
  • Oryza / classification
  • Oryza / genetics
  • Phylogeny
  • Plant Proteins / chemistry
  • Plant Proteins / classification
  • Plant Proteins / genetics
  • Plants / classification
  • Plants / genetics*
  • Protein Conformation
  • Protein Structure, Tertiary
  • Retroelements / genetics*
  • Species Specificity
  • Terminal Repeat Sequences / genetics*

Substances

  • DNA, Plant
  • Plant Proteins
  • Retroelements

Grants and funding

The National Science Foundation 0424833, Division of Biological Infrastructure (DBI) 0603927, Integrated and Organismal Biology (IOS) 0701382 and Molecular and Cellular Biosciences (MCB) 1026200 provided funding. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.