Background: Although differentiation therapy can cure some hematologic malignancies, its curative potential remains unrealized in solid tumors. This is because conventional computational approaches succumb to the thunderous noise of inter-/intratumoral heterogeneity. Using colorectal cancers (CRCs) as an example, here we outline a machine learning(ML)-based approach to track, differentiate, and selectively target cancer stem cells (CSCs).
Methods: A transcriptomic network was built and validated using healthy colon and CRC tissues in diverse gene expression datasets (~5,000 human and >300 mouse samples). Therapeutic targets and perturbation strategies were prioritized using ML, with the goal of reinstating the expression of a transcriptional identifier of the differentiated colonocyte, CDX2, whose loss in poorly differentiated (CSC-enriched) CRCs doubles the risk of relapse/death. The top candidate target was then engaged with a clinical-grade drug and tested on 3 models: CRC lines in vitro, xenografts in mice, and in a prospective cohort of healthy (n = 3) and CRC (n = 23) patient-derived organoids (PDOs).
Results: The drug shifts the network predictably, induces CDX2 and crypt differentiation, and shows cytotoxicity in all 3 models, with a high degree of selectivity towards all CDX2-negative cell lines, xenotransplants, and PDOs. The potential for effective pairing of therapeutic efficacy (IC50) and biomarker (CDX2-low state) is confirmed in PDOs using multivariate analyses. A 50-gene signature of therapeutic response is derived and tested on 9 independent cohorts (~1700 CRCs), revealing the impact of CDX2-reinstatement therapy could translate into a ~50% reduction in the risk of mortality/recurrence.
Conclusions: Findings not only validate the precision of the ML approach in targeting CSCs, and objectively assess its impact on clinical outcome, but also exemplify the use of ML in yielding clinical directive information for enhancing personalized medicine.