Predicting human microRNA-disease associations based on support vector machine

Int J Data Min Bioinform. 2013;8(3):282-93. doi: 10.1504/ijdmb.2013.056078.

Abstract

The identification of disease-related microRNAs is vital for understanding the pathogenesis of disease at the molecular level and may lead to the design of specific molecular tools for diagnosis, treatment and prevention. Experimental identification of disease-related microRNAs poses difficulties. Computational prediction of microRNA-disease associations is one of the complementary means. However, one major issue in microRNA studies is the lack of bioinformatics programs to accurately predict microRNA-disease associations. Herein, we present a machine-learning-based approach for distinguishing positive microRNA-disease associations from negative microRNA-disease associations. A set of features was extracted for each positive and negative microRNA-disease association, and a Support Vector Machine (SVM) classifier was trained, which achieved the area under the ROC curve of up to 0.8884 in 10-fold cross-validation procedure, indicating that the SVM-based approach described here can be used to predict potential microRNA-disease associations and formulate testable hypotheses to guide future biological experiments.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Disease / genetics
  • Humans
  • MicroRNAs / chemistry*
  • ROC Curve
  • Support Vector Machine*

Substances

  • MicroRNAs