A culture-independent approach, supervised machine learning, and the characterization of the microbial community composition of coastal areas across the Bay of Bengal and the Arabian Sea

BMC Microbiol. 2024 May 10;24(1):162. doi: 10.1186/s12866-024-03295-4.

Abstract

Background: Coastal areas are subject to various anthropogenic and natural influences. In this study, we investigated and compared the characteristics of two coastal regions, Andhra Pradesh (AP) and Goa (GA), focusing on pollution, anthropogenic activities, and recreational impacts. We explored three main factors influencing the differences between these coastlines: The Bay of Bengal's shallower depth and lower salinity; upwelling phenomena due to the thermocline in the Arabian Sea; and high tides that can cause strong currents that transport pollutants and debris.

Results: The microbial diversity in GA was significantly higher than that in AP, which might be attributed to differences in temperature, soil type, and vegetation cover. 16S rRNA amplicon sequencing and bioinformatics analysis indicated the presence of diverse microbial phyla, including candidate phyla radiation (CPR). Statistical analysis, random forest regression, and supervised machine learning models classification confirm the diversity of the microbiome accurately. Furthermore, we have identified 450 cultures of heterotrophic, biotechnologically important bacteria. Some strains were identified as novel taxa based on 16S rRNA gene sequencing, showing promising potential for further study.

Conclusion: Thus, our study provides valuable insights into the microbial diversity and pollution levels of coastal areas in AP and GA. These findings contribute to a better understanding of the impact of anthropogenic activities and climate variations on biology of coastal ecosystems and biodiversity.

Keywords: Amplicon sequencing; Coastal marine microbiome; Metagenomics; Microbial composition and function; Operational taxonomic unit; Supervised machine learning.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria* / classification
  • Bacteria* / genetics
  • Bacteria* / isolation & purification
  • Bays* / microbiology
  • Biodiversity
  • DNA, Bacterial / genetics
  • India
  • Microbiota* / genetics
  • Phylogeny*
  • RNA, Ribosomal, 16S* / genetics
  • Salinity
  • Seawater* / microbiology
  • Sequence Analysis, DNA / methods
  • Supervised Machine Learning*

Substances

  • RNA, Ribosomal, 16S
  • DNA, Bacterial