Analyses of the structural organization of unidentified open reading frames from metagenome

Biochem Biophys Res Commun. 2007 May 18;356(4):961-7. doi: 10.1016/j.bbrc.2007.03.090. Epub 2007 Mar 26.

Abstract

Although there is no sequence information, activity-based screening methods can select positive clones from a metagenomic library. However, the low frequency of positive hits that is caused by improper expression of proteins in the cloning host Escherichia coli might be improved. In order to investigate whether the metagenome can be expressed in E. coli, the structural organization of URFs from metagenome was analyzed in terms of transcription and translation factors, and compared to those of 4300 ORFs of E. coli K12. Considerable differences in amino acid composition and codon usage occurred between the metagenome URFs and E. coli ORFs, reflecting a barrier for protein expression within the host E. coli. From the analyses of the promoter and RBS regions, sequences or patterns in the corresponding region of metagenome URFs were found to be dissimilar to E. coli consensus. These results suggested that these factors are considerable to screen the clones from metagenomic library with the activity-based approach.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Consensus Sequence
  • Escherichia coli K12 / genetics*
  • Genome, Bacterial / genetics*
  • Molecular Sequence Data
  • Open Reading Frames / genetics*
  • Promoter Regions, Genetic / genetics*
  • Proteome / genetics*
  • Sequence Homology, Nucleic Acid

Substances

  • Proteome