Large-Scale Evolutionary Analysis of Genes and Supergene Clusters from Terpenoid Modular Pathways Provides Insights into Metabolic Diversification in Flowering Plants

PLoS One. 2015 Jun 5;10(6):e0128808. doi: 10.1371/journal.pone.0128808. eCollection 2015.

Abstract

An important component of plant evolution is the plethora of pathways producing more than 200,000 biochemically diverse specialized metabolites with pharmacological, nutritional and ecological significance. To unravel dynamics underlying metabolic diversification, it is critical to determine lineage-specific gene family expansion in a phylogenomics framework. However, robust functional annotation is often only available for core enzymes catalyzing committed reaction steps within few model systems. In a genome informatics approach, we extracted information from early-draft gene-space assemblies and non-redundant transcriptomes to identify protein families involved in isoprenoid biosynthesis. Isoprenoids comprise terpenoids with various roles in plant-environment interaction, such as pollinator attraction or pathogen defense. Combining lines of evidence provided by synteny, sequence homology and Hidden-Markov-Modelling, we screened 17 genomes including 12 major crops and found evidence for 1,904 proteins associated with terpenoid biosynthesis. Our terpenoid genes set contains evidence for 840 core terpene-synthases and 338 triterpene-specific synthases. We further identified 190 prenyltransferases, 39 isopentenyl-diphosphate isomerases as well as 278 and 219 proteins involved in mevalonate and methylerithrol pathways, respectively. Assessing the impact of gene and genome duplication to lineage-specific terpenoid pathway expansion, we illustrated key events underlying terpenoid metabolic diversification within 250 million years of flowering plant radiation. By quantifying Angiosperm-wide versatility and phylogenetic relationships of pleiotropic gene families in terpenoid modular pathways, our analysis offers significant insight into evolutionary dynamics underlying diversification of plant secondary metabolism. Furthermore, our data provide a blueprint for future efforts to identify and more rapidly clone terpenoid biosynthetic genes from any plant species.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alkyl and Aryl Transferases / genetics
  • Alkyl and Aryl Transferases / metabolism
  • Biological Evolution
  • Carbon-Carbon Double Bond Isomerases / genetics
  • Carbon-Carbon Double Bond Isomerases / metabolism
  • Dimethylallyltranstransferase / genetics
  • Dimethylallyltranstransferase / metabolism
  • Genome, Plant*
  • Hemiterpenes
  • Isoenzymes / genetics
  • Isoenzymes / metabolism
  • Magnoliopsida / classification
  • Magnoliopsida / genetics*
  • Magnoliopsida / metabolism
  • Metabolic Networks and Pathways / genetics
  • Metabolomics
  • Mevalonic Acid / metabolism
  • Molecular Sequence Annotation
  • Multigene Family*
  • Phylogeny*
  • Plant Proteins / genetics*
  • Plant Proteins / metabolism
  • Terpenes / metabolism*

Substances

  • Hemiterpenes
  • Isoenzymes
  • Plant Proteins
  • Terpenes
  • Alkyl and Aryl Transferases
  • terpene synthase
  • Dimethylallyltranstransferase
  • Carbon-Carbon Double Bond Isomerases
  • isopentenyldiphosphate delta-isomerase
  • Mevalonic Acid

Grants and funding

This work was funded by a Netherlands Organization for Scientific Research (NWO) Ecogenomics grant (M.E.S.).