Cohesion group approach for evolutionary analysis of TyrA, a protein family with wide-ranging substrate specificities

Microbiol Mol Biol Rev. 2008 Mar;72(1):13-53, table of contents. doi: 10.1128/MMBR.00026-07.

Abstract

Many enzymes and other proteins are difficult subjects for bioinformatic analysis because they exhibit variant catalytic, structural, regulatory, and fusion mode features within a protein family whose sequences are not highly conserved. However, such features reflect dynamic and interesting scenarios of evolutionary importance. The value of experimental data obtained from individual organisms is instantly magnified to the extent that given features of the experimental organism can be projected upon related organisms. But how can one decide how far along the similarity scale it is reasonable to go before such inferences become doubtful? How can a credible picture of evolutionary events be deduced within the vertical trace of inheritance in combination with intervening events of lateral gene transfer (LGT)? We present a comprehensive analysis of a dehydrogenase protein family (TyrA) as a prototype example of how these goals can be accomplished through the use of cohesion group analysis. With this approach, the full collection of homologs is sorted into groups by a method that eliminates bias caused by an uneven representation of sequences from organisms whose phylogenetic spacing is not optimal. Each sufficiently populated cohesion group is phylogenetically coherent and defined by an overall congruence with a distinct section of the 16S rRNA gene tree. Exceptions that occasionally are found implicate a clearly defined LGT scenario whereby the recipient lineage is apparent and the donor lineage of the gene transferred is localized to those organisms that define the cohesion group. Systematic procedures to manage and organize otherwise overwhelming amounts of data are demonstrated.

Publication types

  • Research Support, N.I.H., Extramural
  • Review

MeSH terms

  • Amino Acid Sequence
  • Bacteria, Anaerobic / enzymology
  • Bacteria, Anaerobic / genetics
  • Bacterial Proteins / chemistry*
  • Bacterial Proteins / classification*
  • Bacterial Proteins / genetics
  • Coenzymes / classification
  • Coenzymes / genetics
  • Coenzymes / metabolism
  • Computational Biology / methods*
  • Evolution, Molecular*
  • Gene Transfer, Horizontal
  • Molecular Sequence Data
  • Multienzyme Complexes / chemistry*
  • Multienzyme Complexes / classification*
  • Multienzyme Complexes / genetics
  • Phylogeny*
  • Substrate Specificity
  • Tyrosine / biosynthesis
  • Tyrosine / genetics

Substances

  • Bacterial Proteins
  • Coenzymes
  • Multienzyme Complexes
  • TyrA protein, Bacteria
  • Tyrosine