What can we learn when fitting a simple telegraph model to a complex gene expression model?

PLoS Comput Biol. 2024 May 14;20(5):e1012118. doi: 10.1371/journal.pcbi.1012118. eCollection 2024 May.

Abstract

In experiments, the distributions of mRNA or protein numbers in single cells are often fitted to the random telegraph model which includes synthesis and decay of mRNA or protein, and switching of the gene between active and inactive states. While commonly used, this model does not describe how fluctuations are influenced by crucial biological mechanisms such as feedback regulation, non-exponential gene inactivation durations, and multiple gene activation pathways. Here we investigate the dynamical properties of four relatively complex gene expression models by fitting their steady-state mRNA or protein number distributions to the simple telegraph model. We show that despite the underlying complex biological mechanisms, the telegraph model with three effective parameters can accurately capture the steady-state gene product distributions, as well as the conditional distributions in the active gene state, of the complex models. Some effective parameters are reliable and can reflect realistic dynamic behaviors of the complex models, while others may deviate significantly from their real values in the complex models. The effective parameters can also be applied to characterize the capability for a complex model to exhibit multimodality. Using additional information such as single-cell data at multiple time points, we provide an effective method of distinguishing the complex models from the telegraph model. Furthermore, using measurements under varying experimental conditions, we show that fitting the mRNA or protein number distributions to the telegraph model may even reveal the underlying gene regulation mechanisms of the complex models. The effectiveness of these methods is confirmed by analysis of single-cell data for E. coli and mammalian cells. All these results are robust with respect to cooperative transcriptional regulation and extrinsic noise. In particular, we find that faster relaxation speed to the steady state results in more precise parameter inference under large extrinsic noise.

MeSH terms

  • Animals
  • Escherichia coli / genetics
  • Gene Expression*
  • Gene Regulatory Networks
  • Mice
  • Models, Genetic*
  • Proteins / analysis
  • Proteins / genetics
  • RNA, Messenger / analysis
  • RNA, Messenger / genetics
  • Single-Cell Gene Expression Analysis

Substances

  • RNA, Messenger
  • Proteins

Grants and funding

F. J. acknowledges support from National Natural Science Foundation of China with grant No. 12271118. L. B. acknowledges funding from the U.S. National Science Foundation (NSF) grant 2029121, a Cecil H. and Ida Green Endowment, and the University of Texas at Dallas. C. J. acknowledges support from National Natural Science Foundation of China with NSAF grant No. U2230402 and grant No. 12271020. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.