oCEM: Automatic detection and analysis of overlapping co-expressed gene modules

BMC Genomics. 2022 Jan 8;23(1):39. doi: 10.1186/s12864-021-08072-5.

Abstract

Background: When it comes to the co-expressed gene module detection, its typical challenges consist of overlap between identified modules and local co-expression in a subset of biological samples. The nature of module detection is the use of unsupervised clustering approaches and algorithms. Those methods are advanced undoubtedly, but the selection of a certain clustering method for sample- and gene-clustering tasks is separate, in which the latter task is often more complicated.

Results: This study presented an R-package, Overlapping CoExpressed gene Module (oCEM), armed with the decomposition methods to solve the challenges above. We also developed a novel auxiliary statistical approach to select the optimal number of principal components using a permutation procedure. We showed that oCEM outperformed state-of-the-art techniques in the ability to detect biologically relevant modules additionally.

Conclusions: oCEM helped non-technical users easily perform complicated statistical analyses and then gain robust results. oCEM and its applications, along with example data, were freely provided at https://github.com/huynguyen250896/oCEM .

Keywords: Analysis of modules; Clinical feature association; Co-expression; Gene expression; Identification of modules.

MeSH terms

  • Algorithms*
  • Cluster Analysis
  • Gene Expression Profiling
  • Gene Regulatory Networks*