Critical evaluation of the FANTOM3 non-coding RNA transcripts

Karl J V Nordström; Majd A I Mirza; Markus Sällman Almén; David E Gloriam; Robert Fredriksson; Helgi B Schiöth

doi:10.1016/j.ygeno.2009.05.012

Critical evaluation of the FANTOM3 non-coding RNA transcripts

Genomics. 2009 Sep;94(3):169-76. doi: 10.1016/j.ygeno.2009.05.012. Epub 2009 Jun 6.

Authors

Karl J V Nordström¹, Majd A I Mirza, Markus Sällman Almén, David E Gloriam, Robert Fredriksson, Helgi B Schiöth

Affiliation

¹ Department of Neuroscience, Uppsala University, Sweden. karl.nordstrom@neuro.uu.se

PMID: 19505569
DOI: 10.1016/j.ygeno.2009.05.012

Abstract

We studied the genomic positions of 38,129 putative ncRNAs from the RIKEN dataset in relation to protein-coding genes. We found that the dataset has 41% sense, 6% antisense, 24% intronic and 29% intergenic transcripts. Interestingly, 17,678 (47%) of the FANTOM3 transcripts were found to potentially be internally primed from longer transcripts. The highest fraction of these transcripts was found among the intronic transcripts and as many as 77% or 6929 intronic transcripts were both internally primed and unspliced. We defined a filtered subset of 8535 transcripts that did not overlap with protein-coding genes, did not contain ORFs longer than 100 residues and were not internally primed. This dataset contains 53% of the FANTOM3 transcripts associated to known ncRNA in RNAdb and expands previous similar efforts with 6523 novel transcripts. This bioinformatic filtering of the FANTOM3 non-coding dataset has generated a lead dataset of transcripts without signs of being artefacts, providing a suitable dataset for investigation with hybridization-based techniques.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Computational Biology
Databases, Genetic*
Expressed Sequence Tags
Genome, Human
Humans
Introns / genetics
Proteins / genetics
RNA, Messenger / genetics
RNA, Untranslated / genetics*
Sequence Analysis, RNA
Transcription, Genetic*

Substances

Proteins
RNA, Messenger
RNA, Untranslated