Analyses of non-leucine-rich repeat (non-LRR) regions intervening between LRRs in proteins

Biochim Biophys Acta. 2009 Oct;1790(10):1217-37. doi: 10.1016/j.bbagen.2009.06.014. Epub 2009 Jul 3.

Abstract

Background: Many proteins have LRR (leucine-rich repeat) units interrupted by non-LRRs which we call IR (non-LRR island region).

Methods: We identified proteins containing LRR@IRs (LRRs having IR) by using a new method and then analyzed their natures and distributions.

Results: LRR@IR proteins were found in over two hundred proteins from prokaryotes and from eukaryotes. These are divided into twenty-one different protein families. The IRs occur one to four times in LRR regions and range in length from 5 to 11,265 residues. The IR lengths in Fungi adenylate cyclases (acys) range from 5 to 116 residues; there are 22 LRR repeats. The IRs in Leishmania proteophosphoglycans (ppgs) vary from 105 to 11,265 residues. These results indicate that the IRs evolved rapidly. A group of LRR@IR proteins-LRRC17, chondroadherin-like protein, ppgs, and four Pseudomonas proteins-have a super motif consisting of an LRR block and its adjacent LRR@IR region. This indicates that the entire super motif experienced duplication. The sequence analysis of IRs offers functional similarity in some LRR@IR protein families.

General significance: This study suggests that various IRs and super motifs provide a great variety of structures and functions for LRRs.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Arabidopsis Proteins / genetics
  • Bacterial Proteins / genetics
  • Caenorhabditis elegans Proteins / genetics
  • Computational Biology / methods*
  • Databases, Protein*
  • Humans
  • Leucine-Rich Repeat Proteins
  • Membrane Proteins / genetics
  • Molecular Sequence Data
  • Multigene Family / genetics
  • Proteins / classification
  • Proteins / genetics*
  • Proteoglycans / genetics
  • Protozoan Proteins / genetics
  • Repetitive Sequences, Amino Acid
  • Sequence Homology, Amino Acid

Substances

  • Arabidopsis Proteins
  • Bacterial Proteins
  • Caenorhabditis elegans Proteins
  • Leucine-Rich Repeat Proteins
  • Membrane Proteins
  • Ppg1 protein, Leishmania
  • Proteins
  • Proteoglycans
  • Protozoan Proteins