A compressed sensing based approach for subtyping of leukemia from gene expression data

J Bioinform Comput Biol. 2011 Oct;9(5):631-45. doi: 10.1142/s0219720011005689.

Abstract

With the development of genomic techniques, the demand for new methods that can handle high-throughput genome-wide data effectively is becoming stronger than ever before. Compressed sensing (CS) is an emerging approach in statistics and signal processing. With the CS theory, a signal can be uniquely reconstructed or approximated from its sparse representations, which can therefore better distinguish different types of signals. However, the application of CS approach to genome-wide data analysis has been rarely investigated. We propose a novel CS-based approach for genomic data classification and test its performance in the subtyping of leukemia through gene expression analysis. The detection of subtypes of cancers such as leukemia according to different genetic markups is significant, which holds promise for the individualization of therapies and improvement of treatments. In our work, four statistical features were employed to select significant genes for the classification. With our selected genes out of 7,129 ones, the proposed CS method achieved a classification accuracy of 97.4% when evaluated with the cross validation and 94.3% when evaluated with another independent data set. The robustness of the method to noise was also tested, giving good performance. Therefore, this work demonstrates that the CS method can effectively detect subtypes of leukemia, implying improved accuracy of diagnosis of leukemia.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Validation Study

MeSH terms

  • Computational Biology / methods
  • Databases, Genetic
  • Gene Expression Profiling / statistics & numerical data
  • Genome-Wide Association Study / statistics & numerical data
  • Humans
  • Leukemia / classification*
  • Leukemia / genetics*
  • Leukemia, Myeloid, Acute / classification
  • Leukemia, Myeloid, Acute / genetics
  • Models, Genetic
  • Precursor Cell Lymphoblastic Leukemia-Lymphoma / classification
  • Precursor Cell Lymphoblastic Leukemia-Lymphoma / genetics