Evaluating prediction model performance

Surgery. 2023 Sep;174(3):723-726. doi: 10.1016/j.surg.2023.05.023. Epub 2023 Jul 5.

Abstract

This article highlights important performance metrics to consider when evaluating models developed for supervised classification or regression tasks using clinical data. When evaluating model performance, we detail the basics of confusion matrices, receiver operating characteristic curves, F1 scores, precision-recall curves, mean squared error, and other considerations. In this era, defined by the rapid proliferation of advanced prediction models, familiarity with various performance metrics beyond the area under the receiver operating characteristic curves and the nuances of evaluating model value upon implementation is essential to ensure effective resource allocation and optimal patient care delivery.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • Delivery of Health Care*
  • Humans
  • Models, Theoretical
  • ROC Curve*
  • Resource Allocation