Bioinformatics-Based Identification of Key Prognostic Genes in Neuroblastoma with a Focus on Immune Cell Infiltration and Diagnostic Potential of VGF

Pharmgenomics Pers Med. 2024 Oct 10:17:453-472. doi: 10.2147/PGPM.S461072. eCollection 2024.

Abstract

Objective: This study aims to identify differentially expressed genes (DEGs) in neuroblastoma (NB) through comprehensive bioinformatics analysis and machine learning techniques. We seek to elucidate these DEGs' biological functions and associated signaling pathways. Furthermore, our objective extends to predicting upstream microRNAs (miRNAs) and relevant transcription factors of pivotal genes, with the ultimate goal of guiding clinical diagnostics and informing future treatment strategies for Neuroblastoma.

Methods: In this study, we sourced datasets GSE49710 and TARGET from the GEO and UCSC-XENA databases, respectively. Differentially expressed genes (DEGs) were identified using the R language "limma" package. Subsequent Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses of these DEGs were conducted using the "clusterProfiler" package. We employed Weighted Gene Co-expression Network Analysis (WGCNA) to isolate the most significant modules associated with death and MYCN amplification, specifically MEpink and MEbrown modules. These modules were then cross-referenced with the DEGs for further GO and KEGG pathway analyses. LASSO regression analysis, facilitated by the "glmnet" package, was utilized to pinpoint three hub genes. We performed differential analysis on these genes and constructed Receiver Operating Characteristic (ROC) curves for disease diagnosis purposes. Immune infiltration analysis was conducted using the "GSVA" package's ssGSEA function. Additionally, single-gene Gene Set Enrichment Analysis (GSEA) on the hub gene was carried out based on Reactome and KEGG databases. Upstream miRNA and transcription factors associated with the hub gene were predicted using RegNetwork, with visual representations created in Cytoscape. Furthermore, to validate the three identified markers in neuroblastoma tissues, quantitative Real-Time Polymerase Chain Reaction (qRT-PCR) analysis was conducted.

Results: We identified 483 differentially expressed genes (DEGs) in neuroblastoma. These genes predominantly function in protein translation, membrane composition, and RNA transcription regulation, and are implicated in multiple signaling pathways relevant to neurodegenerative diseases. Utilizing LASSO regression analysis, we pinpointed three hub genes: VGF, DGKD, and C19orf52. The Receiver Operating Characteristic (ROC) curve analysis yielded Area Under Curve (AUC) values of 0.751 and 0.722 for VGF, 0.79 and 0.656 for DGKD, and 0.8 and 0.753 for C19orf52, respectively. Our immune infiltration analysis revealed significant correlations among monocytes, follicular helper T cells, and CD4+ T cells. Notably, in the death group, we observed heightened infiltration levels of activated CD4+ T cells, macrophages, and Th2 cells. C19orf52 exhibited a close association with the infiltration of monocytes, CD4+ T cells, and Th2 cells, with P-values less than 0.05. Furthermore, qRT-PCR analysis corroborated the upregulation of VGF in neuroblastoma tissues, further validating our findings.

Conclusion: The hub genes (VGF, DGKD, and C19orf52) of neuroblastoma are screened. VGF, one of the hub genes, may have a high diagnostic value and is involved in the immune cell infiltration in neuroblastoma tissue, which may be used as a biomarker for the diagnosis of neuroblastoma and provides a new direction for clinical prognosis prediction and management improvement.

Keywords: LASSO regression; WGCNA; bioinformatics analysis; neuroblastoma.

Grants and funding

There is no funding to report.