Evaluation of Machine Learning Methods to Predict Coronary Artery Disease Using Metabolomic Data

Stud Health Technol Inform. 2017:235:111-115.

Abstract

Metabolomic data can potentially enable accurate, non-invasive and low-cost prediction of coronary artery disease. Regression-based analytical approaches however might fail to fully account for interactions between metabolites, rely on a priori selected input features and thus might suffer from poorer accuracy. Supervised machine learning methods can potentially be used in order to fully exploit the dimensionality and richness of the data. In this paper, we systematically implement and evaluate a set of supervised learning methods (L1 regression, random forest classifier) and compare them to traditional regression-based approaches for disease prediction using metabolomic data.

Keywords: EHR; coronary artery disease; machine learning; random forest.

Publication types

  • Evaluation Study

MeSH terms

  • Aged
  • Coronary Artery Disease / diagnosis*
  • Coronary Artery Disease / metabolism*
  • Female
  • Humans
  • Logistic Models
  • Machine Learning*
  • Male
  • Metabolomics*
  • Middle Aged
  • Models, Theoretical