Genome-wide profiling of untranslated regions by paired-end ditag sequencing reveals unexpected transcriptome complexity in yeast

Mol Genet Genomics. 2015 Feb;290(1):217-24. doi: 10.1007/s00438-014-0913-6. Epub 2014 Sep 12.

Abstract

The identification of structural and functional elements encoded in a genome is a challenging task. Although the transcriptome of budding yeast has been extensively analyzed, the boundaries and untranslated regions of yeast genes remain elusive. To address this least-explored field of yeast genomics, we performed a transcript profiling analysis through paired-end ditag (PET) approach coupled with deep sequencing. With 562,133 PET sequences we accurately defined the boundaries and untranslated regions of 3,409 ORFs, suggesting many yeast genes have multiple transcription start sites (TSSs). We also identified 85 previously uncharacterized transcripts either in intergenic regions or from the opposite strand of reported genomic features. Furthermore, our data revealed the extensive 3' end heterogeneity of yeast genes and identified a novel putative motif for polyadenylation. Our results indicate the yeast transcriptome is more complex than expected. This study would serve as an invaluable resource for elucidating the regulation and evolution of yeast genes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3' Untranslated Regions
  • 5' Untranslated Regions
  • Amino Acid Sequence
  • Gene Expression Profiling*
  • Gene Expression Regulation, Fungal
  • Genome, Fungal / genetics*
  • Molecular Sequence Data
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Saccharomyces cerevisiae / genetics*
  • Sequence Analysis, DNA / methods*
  • Transcription Initiation Site
  • Transcriptome / genetics*
  • Untranslated Regions / genetics*

Substances

  • 3' Untranslated Regions
  • 5' Untranslated Regions
  • RNA, Messenger
  • Untranslated Regions