Methods for multi-omic data integration in cancer research

Front Genet. 2024 Sep 19:15:1425456. doi: 10.3389/fgene.2024.1425456. eCollection 2024.

Abstract

Multi-omics data integration is a term that refers to the process of combining and analyzing data from different omic experimental sources, such as genomics, transcriptomics, methylation assays, and microRNA sequencing, among others. Such data integration approaches have the potential to provide a more comprehensive functional understanding of biological systems and has numerous applications in areas such as disease diagnosis, prognosis and therapy. However, quantitative integration of multi-omic data is a complex task that requires the use of highly specialized methods and approaches. Here, we discuss a number of data integration methods that have been developed with multi-omics data in view, including statistical methods, machine learning approaches, and network-based approaches. We also discuss the challenges and limitations of such methods and provide examples of their applications in the literature. Overall, this review aims to provide an overview of the current state of the field and highlight potential directions for future research.

Keywords: LASSO; cancer biology; data integration; multi-omics; regulatory models; statistical and probabilistic modelling.

Publication types

  • Review

Grants and funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This research was supported by Intramural funds from the National Institute of Genomic Medicine.