Prediction of functional class of proteins and peptides irrespective of sequence homology by support vector machines

Zhi Qun Tang; Hong Huang Lin; Hai Lei Zhang; Lian Yi Han; Xin Chen; Yu Zong Chen

doi:10.4137/bbi.s315

Prediction of functional class of proteins and peptides irrespective of sequence homology by support vector machines

Bioinform Biol Insights. 2009 Nov 24:1:19-47. doi: 10.4137/bbi.s315.

Authors

Zhi Qun Tang¹, Hong Huang Lin, Hai Lei Zhang, Lian Yi Han, Xin Chen, Yu Zong Chen

Affiliation

¹ Department of Pharmacy and Department of Computational Science, National University of Singapore, Republic of Singapore, 117543.

Abstract

Various computational methods have been used for the prediction of protein and peptide function based on their sequences. A particular challenge is to derive functional properties from sequences that show low or no homology to proteins of known function. Recently, a machine learning method, support vector machines (SVM), have been explored for predicting functional class of proteins and peptides from amino acid sequence derived properties independent of sequence similarity, which have shown promising potential for a wide spectrum of protein and peptide classes including some of the low- and non-homologous proteins. This method can thus be explored as a potential tool to complement alignment-based, clustering-based, and structure-based methods for predicting protein function. This article reviews the strategies, current progresses, and underlying difficulties in using SVM for predicting the functional class of proteins. The relevant software and web-servers are described. The reported prediction performances in the application of these methods are also presented.

Keywords: machine learning method; peptide function; protein family; protein function; protein function prediction; support vector machines.