CosGeneGate selects multi-functional and credible biomarkers for single-cell analysis

Brief Bioinform. 2024 Nov 22;26(1):bbae626. doi: 10.1093/bib/bbae626.

Abstract

Motivation: Selecting representative genes or marker genes to distinguish cell types is an important task in single-cell sequencing analysis. Although many methods have been proposed to select marker genes, the genes selected may have redundancy and/or do not show cell-type-specific expression patterns to distinguish cell types.

Results: Here, we present a novel model, named CosGeneGate, to select marker genes for more effective marker selections. CosGeneGate is inspired by combining the advantages of selecting marker genes based on both cell-type classification accuracy and marker gene specific expression patterns. We demonstrate the better performance of the marker genes selected by CosGeneGate for various downstream analyses than the existing methods with both public datasets and newly sequenced datasets. The non-redundant marker genes identified by CosGeneGate for major cell types and tissues in human can be found at the website as follows: https://github.com/VivLon/CosGeneGate/blob/main/marker gene list.xlsx.

Keywords: Alzheimer’s disease; deep learning; marker genes; single-cell sequencing.

MeSH terms

  • Biomarkers* / metabolism
  • Computational Biology / methods
  • Gene Expression Profiling / methods
  • Genetic Markers
  • Humans
  • Single-Cell Analysis* / methods
  • Software

Substances

  • Biomarkers
  • Genetic Markers