Integrative classification of human coding and noncoding genes through RNA metabolism profiles

Nat Struct Mol Biol. 2017 Jan;24(1):86-96. doi: 10.1038/nsmb.3325. Epub 2016 Nov 21.

Abstract

Pervasive transcription of the human genome results in a heterogeneous mix of coding RNAs and long noncoding RNAs (lncRNAs). Only a small fraction of lncRNAs have demonstrated regulatory functions, thus making functional lncRNAs difficult to distinguish from nonfunctional transcriptional byproducts. This difficulty has resulted in numerous competing human lncRNA classifications that are complicated by a steady increase in the number of annotated lncRNAs. To address these challenges, we quantitatively examined transcription, splicing, degradation, localization and translation for coding and noncoding human genes. We observed that annotated lncRNAs had lower synthesis and higher degradation rates than mRNAs and discovered mechanistic differences explaining slower lncRNA splicing. We grouped genes into classes with similar RNA metabolism profiles, containing both mRNAs and lncRNAs to varying extents. These classes exhibited distinct RNA metabolism, different evolutionary patterns and differential sensitivity to cellular RNA-regulatory pathways. Our classification provides an alternative to genomic context-driven annotations of lncRNAs.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Gene Expression
  • HEK293 Cells
  • Humans
  • Molecular Sequence Annotation
  • Open Reading Frames
  • RNA Splicing
  • RNA, Messenger / genetics*
  • RNA, Messenger / metabolism
  • RNA, Untranslated / genetics*
  • RNA, Untranslated / metabolism

Substances

  • RNA, Messenger
  • RNA, Untranslated