A hybrid Forecast Cost Benefit Classification of diabetes mellitus prevalence based on epidemiological study on Real-life patient's data

Sci Rep. 2019 Jul 12;9(1):10103. doi: 10.1038/s41598-019-46631-9.

Abstract

The increasing ratio of diabetes is found risky across the planet. Therefore, the diagnosis is important in population with extreme risk of diabetes. In this study, a decision-making classifier (J48) is applied over a data-mining platform (Weka) to measure accuracy and linear regression on classification results to forecast cost/benefit ratio in diabetes mellitus patients along with prevalence. In total 108 invasive and non-invasive medical features are considered from 251 patients for assessment, and the real-time data are gathered from Pakistan over a time span of June 2017 to April 2018. The results indicate that J48 classifiers achieved the best accuracy of (99.28%), whereas, error rate (0.08%), Kappa stats, PRC, and MCC are (0.98%), precision, recall, and F-matrix are (0.99%). In addition, true positive rate is (0.99%) and false positive is (0.08%). The regression forecast decision indicates blood pressure and glucose level are key features for diabetes. The cost/benefit matrix indicates two predictions for positive test with accuracy (66.68%) and (30.60%), and key attributes with total Gain (118.13%). The study confirmed the proposed prediction is practical for screening of diabetes mellitus patients at the initial stage without invasive medical tests and found effectual in the early diagnosis of diabetes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Blood Glucose / analysis
  • Blood Pressure / physiology
  • Clinical Decision-Making
  • Cost-Benefit Analysis / methods*
  • Data Mining / methods*
  • Decision Support Systems, Clinical
  • Diabetes Mellitus / diagnosis*
  • Diabetes Mellitus / economics*
  • Diabetes Mellitus / therapy
  • Epidemiologic Studies
  • Forecasting / methods
  • Humans
  • Machine Learning

Substances

  • Blood Glucose