iRNA-2OM: A Sequence-Based Predictor for Identifying 2'-O-Methylation Sites in Homo sapiens

J Comput Biol. 2018 Nov;25(11):1266-1277. doi: 10.1089/cmb.2018.0004. Epub 2018 Aug 16.

Abstract

2'-O-methylation plays an important biological role in gene expression. Owing to the explosive increase in genomic sequencing data, it is necessary to develop a method for quickly and efficiently identifying whether a sequence contains the 2'-O-methylation site. As an additional method to the experimental technique, a computational method may help to identify 2'-O-methylation sites. In this study, based on the experimental 2'-O-methylation data of Homo sapiens, we proposed a support vector machine-based model to predict 2'-O-methylation sites in H. sapiens. In this model, the RNA sequences were encoded with the optimal features obtained from feature selection. In the fivefold cross-validation test, the accuracy reached 97.95%.

Keywords: 2′-O-methylation; Homo sapiens; PseKNC; RNA sequence; chemical property.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Computational Biology / methods*
  • Genome, Human*
  • Humans
  • Methylation
  • RNA / chemistry*
  • RNA Interference
  • Software*
  • Support Vector Machine*

Substances

  • RNA