Cancer subtype prediction from a pathway-level perspective by using a support vector machine based on integrated gene expression and protein network

Comput Methods Programs Biomed. 2017 Apr:141:27-34. doi: 10.1016/j.cmpb.2017.01.006. Epub 2017 Jan 20.

Abstract

Background and objective: Distinguishing cancer subtypes is critical for selecting the appropriate treatment strategy. Bioinformatics approaches have gradually taken the place of clinical observations and pathological experiments. However, these approaches are typically only used in gene expression profiling. Previous studies have primarily focused on the gene level or specific diseases, and thus pathway-level factors have not been considered. Therefore, a computational method that integrates gene expression and pathway is necessary.

Methods: This study presented an approach to determine potential fragments of activated pathways around protein networks in different stages of disease. We used a scored equation that integrates genomic and proteomic information and determined the intensity of the pathway link change. A support vector machine (SVM) was used to train and test subtype-predicted models.

Results: The performance of the proposed method was evaluated by calculating prediction accuracy. The average prediction accuracy was 67.64% for three subtypes in tumors of neuroepithelial tissues. The results demonstrate that the proposed method applies fewer features than gene expression methods used to obtain similar results CONCLUSIONS: This study suggests a method to implement a cancer subtype classifier based on an SVM from a pathway-level perspective.

Keywords: Cancer subtype; Computational method; Gene expression; Neuroepithelial tumor; Protein–protein interaction; Signaling pathway.

MeSH terms

  • Algorithms
  • Gene Expression*
  • Humans
  • Neoplasms / classification*
  • Neoplasms / genetics
  • Neoplasms / metabolism
  • Proteins / metabolism*
  • Support Vector Machine*

Substances

  • Proteins