Kernel methods for large-scale genomic data analysis

Brief Bioinform. 2015 Mar;16(2):183-92. doi: 10.1093/bib/bbu024. Epub 2014 Jul 22.

Abstract

Machine learning, particularly kernel methods, has been demonstrated as a promising new tool to tackle the challenges imposed by today's explosive data growth in genomics. They provide a practical and principled approach to learning how a large number of genetic variants are associated with complex phenotypes, to help reveal the complexity in the relationship between the genetic markers and the outcome of interest. In this review, we highlight the potential key role it will have in modern genomic data processing, especially with regard to integration with classical methods for gene prioritizing, prediction and data fusion.

Keywords: association test; kernel logistic regression; kernel methods; lasso; machine learning; prediction; structured mapping.

Publication types

  • Research Support, N.I.H., Extramural
  • Review

MeSH terms

  • Computational Biology
  • Data Interpretation, Statistical
  • Genome-Wide Association Study / statistics & numerical data
  • Genomics / statistics & numerical data*
  • Humans
  • Logistic Models
  • Machine Learning*
  • Models, Statistical
  • Polymorphism, Single Nucleotide
  • Support Vector Machine