Simultaneous characterization of sense and antisense genomic processes by the double-stranded hidden Markov model

Nucleic Acids Res. 2016 Mar 18;44(5):e44. doi: 10.1093/nar/gkv1184. Epub 2015 Nov 17.

Abstract

Hidden Markov models (HMMs) have been extensively used to dissect the genome into functionally distinct regions using data such as RNA expression or DNA binding measurements. It is a challenge to disentangle processes occurring on complementary strands of the same genomic region. We present the double-stranded HMM (dsHMM), a model for the strand-specific analysis of genomic processes. We applied dsHMM to yeast using strand specific transcription data, nucleosome data, and protein binding data for a set of 11 factors associated with the regulation of transcription.The resulting annotation recovers the mRNA transcription cycle (initiation, elongation, termination) while correctly predicting strand-specificity and directionality of the transcription process. We find that pre-initiation complex formation is an essentially undirected process, giving rise to a large number of bidirectional promoters and to pervasive antisense transcription. Notably, 12% of all transcriptionally active positions showed simultaneous activity on both strands. Furthermore, dsHMM reveals that antisense transcription is specifically suppressed by Nrd1, a yeast termination factor.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA / genetics*
  • DNA / metabolism
  • DNA, Fungal / genetics*
  • DNA, Fungal / metabolism
  • Gene Expression Regulation, Fungal*
  • Genome, Fungal*
  • Genomics
  • Markov Chains*
  • Nucleosomes / chemistry
  • Nucleosomes / metabolism
  • Promoter Regions, Genetic
  • Protein Binding
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • RNA-Binding Proteins / genetics
  • RNA-Binding Proteins / metabolism
  • Saccharomyces cerevisiae / genetics*
  • Saccharomyces cerevisiae / metabolism
  • Saccharomyces cerevisiae Proteins / genetics
  • Saccharomyces cerevisiae Proteins / metabolism
  • Transcription Factors / genetics
  • Transcription Factors / metabolism
  • Transcription, Genetic

Substances

  • DNA, Fungal
  • NRD1 protein, S cerevisiae
  • Nucleosomes
  • RNA, Messenger
  • RNA-Binding Proteins
  • Saccharomyces cerevisiae Proteins
  • Transcription Factors
  • DNA