MOSCATO: a supervised approach for analyzing multi-Omic single-Cell data

BMC Genomics. 2022 Aug 4;23(1):557. doi: 10.1186/s12864-022-08759-3.

Abstract

Background: Advancements in genomic sequencing continually improve personalized medicine, and recent breakthroughs generate multimodal data on a cellular level. We introduce MOSCATO, a technique for selecting features across multimodal single-cell datasets that relate to clinical outcomes. We summarize the single-cell data using tensors and perform regularized tensor regression to return clinically-associated variable sets for each 'omic' type.

Results: Robustness was assessed over simulations based on available single-cell simulation methods, and applicability was assessed through an example using CITE-seq data to detect genes associated with leukemia. We find that MOSCATO performs favorably in selecting network features while also shown to be applicable to real multimodal single-cell data.

Conclusions: MOSCATO is a useful analytical technique for supervised feature selection in multimodal single-cell data. The flexibility of our approach enables future extensions on distributional assumptions and covariate adjustments.

Keywords: Multi-omics; Multimodal; Network analysis; Single-cell sequencing; Tensor regression.

MeSH terms

  • Precision Medicine*
  • Single-Cell Analysis*