RFAmyloid: A Web Server for Predicting Amyloid Proteins

Int J Mol Sci. 2018 Jul 16;19(7):2071. doi: 10.3390/ijms19072071.

Abstract

Amyloid is an insoluble fibrous protein and its mis-aggregation can lead to some diseases, such as Alzheimer's disease and Creutzfeldt⁻Jakob's disease. Therefore, the identification of amyloid is essential for the discovery and understanding of disease. We established a novel predictor called RFAmy based on random forest to identify amyloid, and it employed SVMProt 188-D feature extraction method based on protein composition and physicochemical properties and pse-in-one feature extraction method based on amino acid composition, autocorrelation pseudo acid composition, profile-based features and predicted structures features. In the ten-fold cross-validation test, RFAmy's overall accuracy was 89.19% and F-measure was 0.891. Results were obtained by comparison experiments with other feature, classifiers, and existing methods. This shows the effectiveness of RFAmy in predicting amyloid protein. The RFAmy proposed in this paper can be accessed through the URL http://server.malab.cn/RFAmyloid/.

Keywords: RFAmy; amyloid protein; machine learning; protein classification; random forest.

MeSH terms

  • Algorithms*
  • Amyloidogenic Proteins / analysis*
  • Computational Biology / methods*
  • Databases, Protein
  • Internet
  • Reproducibility of Results
  • Sequence Analysis, Protein
  • Support Vector Machine*

Substances

  • Amyloidogenic Proteins