Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease affecting motor neurons. Although genes causing familial cases have been identified, those of sporadic ALS, which occupies the majority of patients, are still elusive. In this study, we adopted machine learning to build binary classifiers based on the New York Genome Center (NYGC) ALS Consortium's RNA-seq data of the postmortem spinal cord of ALS and non-neurological disease control. The accuracy of the classifiers was greater than 83% and 77% for the training set and the unseen test set, respectively. The classifiers contained 114 genes. Among them, 41 genes have been reported in previous ALS studies, and others are novel in this field. These genes are involved in mitochondrial respiration, lipid metabolism, endosomal trafficking, and iron metabolism, which may promote the progression of ALS pathology.
Keywords: ALS; Machine learning; RNA-seq; Spinal cord.
© 2025. The Author(s).