The Genome3D Consortium for Structural Annotations of Selected Model Organisms

Methods Mol Biol. 2020:2165:27-67. doi: 10.1007/978-1-0716-0708-4_3.

Abstract

Genome3D consortium is a collaborative project involving protein structure prediction and annotation resources developed by six world-leading structural bioinformatics groups, based in the United Kingdom (namely Blundell, Murzin, Gough, Sternberg, Orengo, and Jones). The main objective of Genome3D serves as a common portal to provide both predicted models and annotations of proteins in model organisms, using several resources developed by these labs such as CATH-Gene3D, DOMSERF, pDomTHREADER, PHYRE, SUPERFAMILY, FUGUE/TOCATTA, and VIVACE. These resources primarily use SCOP- and/or CATH-based protein domain assignments. Another objective of Genome3D is to compare structural classifications of protein domains in CATH and SCOP databases and to provide a consensus mapping of CATH and SCOP protein superfamilies. CATH/SCOP mapping analyses led to the identification of total of 1429 consensus superfamilies.Currently, Genome3D provides structural annotations for ten model organisms, including Homo sapiens, Arabidopsis thaliana, Mus musculus, Escherichia coli, Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster, Plasmodium falciparum, Staphylococcus aureus, and Schizosaccharomyces pombe. Thus, Genome3D serves as a common gateway to each structure prediction/annotation resource and allows users to perform comparative assessment of the predictions. It, thus, assists researchers to broaden their perspective on structure/function predictions of their query protein of interest in selected model organisms.

Keywords: Annotation; CATH; Fold recognition; Function prediction; Hidden Markov model; Homology modeling; Protein domain; Protein family; Protein structure prediction; Protein superfamily; SCOP; Superfamily mapping.

MeSH terms

  • Animals
  • Arabidopsis
  • Genome
  • Genomics / methods
  • Genomics / organization & administration*
  • Humans
  • Information Dissemination
  • Knowledge Bases*
  • Molecular Sequence Annotation / methods*
  • Proteome / chemistry*
  • Sequence Alignment / methods
  • United Kingdom
  • Yeasts

Substances

  • Proteome