Similarity analysis of protein sequences based on the normalized relative-entropy

Comb Chem High Throughput Screen. 2008 Jul;11(6):477-81. doi: 10.2174/138620708784911500.

Abstract

Based on the classification of 20 amino acids, we reduce a protein primary sequence to six (0,1) sequences. For each of them, two so-called normalized relative-entropies are calculated and thus a 12-D vector is constructed to describe the protein primary sequence. The examination of similarities/dissimilarities among eight different proteins illustrates the utility of the approach.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Entropy*
  • Humans
  • Proteins / analysis*
  • Proteins / chemistry*
  • Proteins / classification
  • Sequence Analysis, Protein / methods*

Substances

  • Proteins