Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics

PLoS One. 2013 Sep 30;8(9):e76809. doi: 10.1371/journal.pone.0076809. eCollection 2013.

Abstract

The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.

Publication types

  • Research Support, Non-U.S. Gov't
  • Retracted Publication

MeSH terms

  • Amino Acid Motifs / genetics
  • Arabidopsis / genetics
  • Chromosome Mapping
  • Cluster Analysis
  • Computational Biology
  • Conserved Sequence / genetics
  • Gene Expression Profiling
  • Genome, Plant / genetics*
  • Genomics / methods
  • Glycine max / genetics*
  • Multigene Family / genetics*
  • Oryza / genetics
  • Phylogeny*
  • Sequence Alignment
  • Species Specificity
  • Transcription Factors / genetics*
  • Zinc Fingers / genetics*

Substances

  • Transcription Factors

Grants and funding

This work was supported by the National Natural Science Foundation of China (31071446 and 31271753), the Fundamental Research Funds for ICS-CAAS (Grant to Y. G.), the State High-tech Research and Development Program (2013AA102602) and the National Transgenic Major Program (2013ZX08004-001 and 2013ZX08004-002). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.