Synthetic enhancer design by in silico compensatory evolution reveals flexibility and constraint in cis-regulation

BMC Syst Biol. 2017 Nov 29;11(1):116. doi: 10.1186/s12918-017-0485-2.

Abstract

Background: Models that incorporate specific chemical mechanisms have been successful in describing the activity of Drosophila developmental enhancers as a function of underlying transcription factor binding motifs. Despite this, the minimum set of mechanisms required to reconstruct an enhancer from its constituent parts is not known. Synthetic biology offers the potential to test the sufficiency of known mechanisms to describe the activity of enhancers, as well as to uncover constraints on the number, order, and spacing of motifs.

Results: Using a functional model and in silico compensatory evolution, we generated putative synthetic even-skipped stripe 2 enhancers with varying degrees of similarity to the natural enhancer. These elements represent the evolutionary trajectories of the natural stripe 2 enhancer towards two synthetic enhancers designed ab initio. In the first trajectory, spatially regulated expression was maintained, even after more than a third of binding sites were lost. In the second, sequences with high similarity to the natural element did not drive expression, but a highly diverged sequence about half the length of the minimal stripe 2 enhancer drove ten times greater expression. Additionally, homotypic clusters of Zelda or Stat92E motifs, but not Bicoid, drove expression in developing embryos.

Conclusions: Here, we present a functional model of gene regulation to test the degree to which the known transcription factors and their interactions explain the activity of the Drosophila even-skipped stripe 2 enhancer. Initial success in the first trajectory showed that the gene regulation model explains much of the function of the stripe 2 enhancer. Cases where expression deviated from prediction indicates that undescribed factors likely act to modulate expression. We also showed that activation driven Bicoid and Hunchback is highly sensitive to spatial organization of binding motifs. In contrast, Zelda and Stat92E drive expression from simple homotypic clusters, suggesting that activation driven by these factors is less constrained. Collectively, the 40 sequences generated in this work provides a powerful training set for building future models of gene regulation.

Keywords: Bicoid; Cis-regulatory logic; Dicheate; Even-skipped regulation; Gene regulatory models; Hunchback; Stat92E; Synthetic enhancers; Transriptional control; Zelda.

MeSH terms

  • Animals
  • Binding Sites
  • Computer Simulation
  • DNA-Binding Proteins / genetics
  • DNA-Binding Proteins / metabolism
  • Drosophila Proteins / genetics
  • Drosophila Proteins / metabolism
  • Drosophila melanogaster / genetics*
  • Drosophila melanogaster / growth & development
  • Embryo, Nonmammalian / cytology
  • Embryo, Nonmammalian / metabolism
  • Enhancer Elements, Genetic*
  • Evolution, Molecular*
  • Gene Expression Regulation, Developmental*
  • Homeodomain Proteins / genetics
  • Homeodomain Proteins / metabolism
  • Nuclear Proteins
  • STAT Transcription Factors / genetics
  • STAT Transcription Factors / metabolism
  • Trans-Activators / genetics
  • Trans-Activators / metabolism
  • Transcription Factors / genetics
  • Transcription Factors / metabolism

Substances

  • DNA-Binding Proteins
  • Drosophila Proteins
  • Homeodomain Proteins
  • Nuclear Proteins
  • STAT Transcription Factors
  • Stat92E protein, Drosophila
  • Trans-Activators
  • Transcription Factors
  • bcd protein, Drosophila
  • eve protein, Drosophila
  • hb protein, Drosophila
  • zld protein, Drosophila