Hemodialysis (HD) is the main treatment for end-stage renal disease with high mortality and heavy economic burdens. Predicting the mortality risk in patients undergoing maintenance HD and identifying high-risk patients are critical to enable early intervention and improve quality of life. In this study, we proposed a two-stage protocol based on electronic health record (EHR) data to predict mortality risk of maintenance HD patients. First, we developed a multilayer perceptron (MLP) model to predict mortality risk. Second, an Active Contrastive Learning (ACL) method was proposed to select sample pairs and optimize the representation space to improve the prediction performance of the MLP model. Our ACL method outperforms other methods and has an average F1-score of 0.820 and an average area under the receiver operating characteristic curve of 0.853. This work is generalizable to analyses of cross-sectional EHR data, while this two-stage approach can be applied to other diseases as well.
Keywords: Hemodialysis; active contrastive learning; electronic health records; mortality risk prediction.