RFW captures species-level metagenomic functions by integrating genome annotation information

Cell Rep Methods. 2024 Dec 16;4(12):100932. doi: 10.1016/j.crmeth.2024.100932. Epub 2024 Dec 10.

Abstract

Functional profiling of whole-metagenome shotgun sequencing (WMS) enables our understanding of microbe-host interactions. We demonstrate microbial functional information loss by current annotation methods at both the taxon and community levels, particularly at lower read depths. To address information loss, we develop a framework, RFW (reference-based functional profile inference on WMS), that utilizes information from genome functional annotations and taxonomic profiles to infer microbial function abundances from WMS. Furthermore, we provide an algorithm for absolute abundance change quantification between groups as part of the RFW framework. By applying RFW to several datasets related to autism spectrum disorder and colorectal cancer, we show that RFW augments downstream analyses, such as differential microbial function identification and association analysis between microbial function and host phenotype. RFW is open source and freely available at https://github.com/Xingyinliu-Lab/RFW.

Keywords: CP: Genetics; CP: Microbiology; DFSCA-BC; RFW; abundance change quantification of microbial function; microbial functional profiling; shotgun sequencing; species-level metagenomic functions.

MeSH terms

  • Algorithms*
  • Autism Spectrum Disorder / genetics
  • Colorectal Neoplasms / genetics
  • Humans
  • Metagenome / genetics
  • Metagenomics* / methods
  • Molecular Sequence Annotation*
  • Software