The current practice of determining histologic grade with a single molecular biomarker can facilitate differential diagnosis but cannot predict the risk of lesion progression. Cancer is caused by complex mechanisms, and no single biomarker can both make accurate diagnoses and predict progression risk. Modelling using multiple biomarkers can be used to derive scores for risk prediction. Mathematical models (MMs) may be capable of making predictions from biomarker data. Therefore, this study aimed to develop MM-based scores for predicting the risk of precancerous cervical lesion progression and identifying precancerous lesions in patients in northern Thailand by evaluating the expression of multiple biomarkers. The MMs (Models 1-5) were developed in the test sample set based on patient age range (five categories) and biomarker levels (cortactin, p16INK4A, and Ki-67 by immunohistochemistry [IHC], and HPV E6/E7 ribonucleic acid (RNA) by in situ hybridization [ISH]). The risk scores for the prediction of cervical lesion progression ("risk biomolecules") ranged from 2.56-2.60 in the normal and low-grade squamous intraepithelial lesion (LSIL) cases and from 3.54-3.62 in cases where precancerous lesions were predicted to progress. In Model 4, 23/86 (26.7%) normal and LSIL cases had biomolecule levels that suggested a risk of progression, while 5/86 (5.8%) cases were identified as precancerous lesions. Additionally, histologic grading with a single molecular biomarker did not identify 23 cases with risk, preventing close patient monitoring. These results suggest that biomarker level-based risk scores are useful for predicting the risk of cervical lesion progression and identifying precancerous lesion development. This multiple biomarker-based strategy may ultimately have utility for predicting cancer progression in other contexts.
Keywords: HPV; biomarker; cervical cancer; machine learning; mathematical model; risk score.