Background and aims: Accurate risk stratification for hepatocellular carcinoma (HCC) after achieving a sustained viral response (SVR) is necessary for optimal surveillance. We aimed to develop and validate a machine learning (ML) model to predict the risk of HCC after achieving an SVR in individual patients.
Methods: In this multicenter cohort study, 1742 patients with chronic hepatitis C who achieved an SVR were enrolled. Five ML models were developed including DeepSurv, gradient boosting survival analysis, random survival forest (RSF), survival support vector machine, and a conventional Cox proportional hazard model. Model performance was evaluated using Harrel' c-index and was externally validated in an independent cohort (977 patients).
Results: During the mean observation period of 5.4 years, 122 patients developed HCC (83 in the derivation cohort and 39 in the external validation cohort). The RSF model showed the best discrimination ability using seven parameters at the achievement of an SVR with a c-index of 0.839 in the external validation cohort and a high discriminative ability when the patients were categorized into three risk groups (P <0.001). Furthermore, this RSF model enabled the generation of an individualized predictive curve for HCC occurrence for each patient with an app available online.
Conclusions: We developed and externally validated an RSF model with good predictive performance for the risk of HCC after an SVR. The application of this novel model is available on the website. This model could provide the data to consider an effective surveillance method. Further studies are needed to make recommendations for surveillance policies tailored to the medical situation in each country.
Impact and implications: A novel prediction model for HCC occurrence in patients after hepatitis C virus eradication was developed using machine learning algorithms. This model, using seven commonly measured parameters, has been shown to have a good predictive ability for HCC development and could provide a personalized surveillance system.
Keywords: Hepatitis C; Hepatocellular carcinoma; Machine learning; SVR; Surveillance.
Copyright © 2023 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.