Transcriptomic analyses reveal groups of co-expressed, syntenic lncRNAs in four species of the genus Caenorhabditis

RNA Biol. 2019 Mar;16(3):320-329. doi: 10.1080/15476286.2019.1572438. Epub 2019 Jan 31.

Abstract

Long non-coding RNAs (lncRNAs) are a heterogeneous class of genes that do not code for proteins. Since lncRNAs (or a fraction thereof) are expected to be functional, many efforts have been dedicated to catalog lncRNAs in numerous organisms, but our knowledge of lncRNAs in non vertebrate species remains very limited. Here, we annotated lncRNAs using transcriptomic data from the same larval stage of four Caenorhabditis species. The number of annotated lncRNAs in self-fertile nematodes was lower than in out-crossing species. We used a combination of approaches to identify putatively homologous lncRNAs: synteny, sequence conservation, and structural conservation. We classified a total of 1,532 out of 7,635 genes from the four species into families of lncRNAs with conserved synteny and expression at the larval stage, suggesting that a large fraction of the predicted lncRNAs may be species specific. Despite both sequence and local secondary structure seem to be poorly conserved, sequences within families frequently shared BLASTn hits and short sequence motifs, which were more likely to be unpaired in the predicted structures. We provide the first multi-species catalog of lncRNAs in nematodes and identify groups of lncRNAs with conserved synteny and expression, that share exposed motifs.

Keywords: Lncrna; motifs; secondary structure; synteny.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence
  • Caenorhabditis / classification
  • Caenorhabditis / genetics*
  • Computational Biology / methods
  • Evolution, Molecular
  • Gene Expression Profiling*
  • Gene Expression Regulation
  • Molecular Sequence Annotation
  • Nucleotide Motifs
  • RNA, Long Noncoding / chemistry
  • RNA, Long Noncoding / genetics*
  • Species Specificity
  • Transcriptome*

Substances

  • RNA, Long Noncoding

Grants and funding

This work was supported by the Spanish Ministry of Economy,Industry, and Competitiveness (MEIC), INB Grant (PT17/0009/0023 - ISCIII-SGEFI/ERDF) by the EMBL partnership, and grants ‘Centro de Excelencia Severo Ochoa 2013–2017’ SEV-2012-0208, and BFU2015-67107 cofounded by European Regional 710 Development Fund (ERDF); by the CERCA Programme/Generalitat de Catalunya; from the Catalan Research Agency (AGAUR) SGR857, and grant from the European Union’s Horizon 2020 research and innovation programme under the grant agreement ERC-2016-724173 the Marie Sklodowska-Curie grant agreement No 715 H2020-MSCA-ITN-2014-642095 and the Marie Skłodowska-Curie Actions [H2020-MSCA-IF-2017-793699].