Comparison of data-merging methods with SVM attribute selection and classification in breast cancer gene expression

BMC Bioinformatics. 2012 May 8;13 Suppl 7(Suppl 7):S9. doi: 10.1186/1471-2105-13-S7-S9.

Abstract

Background: DNA microarray data are used to identify genes which could be considered prognostic markers. However, due to the limited sample size of each study, the signatures are unstable in terms of the composing genes and may be limited in terms of performances. It is therefore of great interest to integrate different studies, thus increasing sample size.

Results: In the past, several studies explored the issue of microarray data merging, but the arrival of new techniques and a focus on SVM based classification needed further investigation. We used distant metastasis prediction based on SVM attribute selection and classification to three breast cancer data sets.

Conclusions: The results showed that breast cancer classification does not benefit from data merging, confirming the results found by other studies with different techniques.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Breast Neoplasms / genetics*
  • Female
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation, Neoplastic
  • Humans
  • Neoplasm Metastasis / genetics
  • Oligonucleotide Array Sequence Analysis
  • Support Vector Machine*