Prediction of Hepatocellular Carcinoma After Hepatitis C Virus Sustained Virologic Response Using a Random Survival Forest Model

Hikaru Nakahara; Atsushi Ono; C Nelson Hayes; Yuki Shirane; Ryoichi Miura; Yasutoshi Fujii; Serami Murakami; Kenji Yamaoka; Hauri Bao; Shinsuke Uchikawa; Hatsue Fujino; Eisuke Murakami; Tomokazu Kawaoka; Daiki Miki; Masataka Tsuge; Shiro Oka; Hiroshima Liver Study Group; TransSCOT Consortium

doi:10.1200/CCI.24.00108

Prediction of Hepatocellular Carcinoma After Hepatitis C Virus Sustained Virologic Response Using a Random Survival Forest Model

JCO Clin Cancer Inform. 2024 Dec:8:e2400108. doi: 10.1200/CCI.24.00108. Epub 2024 Dec 18.

Authors

Hikaru Nakahara^{1

2}, Atsushi Ono¹, C Nelson Hayes¹, Yuki Shirane¹, Ryoichi Miura¹, Yasutoshi Fujii^{1

3}, Serami Murakami¹, Kenji Yamaoka¹, Hauri Bao¹, Shinsuke Uchikawa¹, Hatsue Fujino¹, Eisuke Murakami¹, Tomokazu Kawaoka¹, Daiki Miki¹, Masataka Tsuge¹, Shiro Oka¹; Hiroshima Liver Study Group; TransSCOT Consortium

Affiliations

¹ Department of Gastroenterology, Graduate School of Biomedical & Health Sciences, Hiroshima University, Hiroshima, Japan.
² Department of Clinical and Molecular Genetics, Hiroshima University, Hiroshima, Japan.
³ Department of Clinical Oncology, Graduate School of Biomedical and Health Sciences, Hiroshima University, Hiroshima, Japan.

PMID: 39693579
DOI: 10.1200/CCI.24.00108

Abstract

Purpose: Postsustained virologic response (SVR) screening following clinical guidelines does not address individual risk of hepatocellular carcinoma (HCC). Our aim is to provide tailored screening for patients using machine learning to predict HCC incidence after SVR.

Methods: Using clinical data from 1,028 SVR patients, we developed an HCC prediction model using a random survival forest (RSF). Model performance was assessed using Harrel's c-index and validated in an independent cohort of 737 SVR patients. Shapley additive explanation (SHAP) facilitated feature quantification, whereas optimal cutoffs were determined using maximally selected rank statistics. We used Kaplan-Meier analysis to compare cumulative HCC incidence between risk groups.

Results: We achieved c-index scores and 95% CIs of 0.90 (0.85 to 0.94) and 0.80 (0.74 to 0.85) in the derivation and validation cohorts, respectively, in a model using platelet count, gamma-glutamyl transpeptidase, sex, age, and ALT. Stratification resulted in four risk groups: low, intermediate, high, and very high. The 5-year cumulative HCC incidence rates and 95% CIs for these groups were as follows: derivation: 0% (0 to 0), 3.8% (0.6 to 6.8), 26.2% (17.2 to 34.3), and 54.2% (20.2 to 73.7), respectively, and validation: 0.7% (0 to 1.6), 7.1% (2.7 to 11.3), 5.2% (0 to 10.8), and 28.6% (0 to 55.3), respectively.

Conclusion: The integration of RSF and SHAP enabled accurate HCC risk classification after SVR, which may facilitate individualized HCC screening strategies and more cost-effective care.

MeSH terms

Aged
Antiviral Agents / therapeutic use
Carcinoma, Hepatocellular* / diagnosis
Carcinoma, Hepatocellular* / epidemiology
Carcinoma, Hepatocellular* / etiology
Carcinoma, Hepatocellular* / virology
Female
Hepacivirus / isolation & purification
Hepatitis C / complications
Hepatitis C / drug therapy
Hepatitis C / epidemiology
Hepatitis C / virology
Humans
Incidence
Kaplan-Meier Estimate
Liver Neoplasms* / diagnosis
Liver Neoplasms* / epidemiology
Liver Neoplasms* / etiology
Liver Neoplasms* / virology
Machine Learning
Male
Middle Aged
Prognosis
Risk Assessment / methods
Risk Factors
Sustained Virologic Response*

Substances

Antiviral Agents