Bioinformatic identification of previously unrecognized amyloidogenic proteins

J Biol Chem. 2022 May;298(5):101920. doi: 10.1016/j.jbc.2022.101920. Epub 2022 Apr 9.

Abstract

Low-complexity domains (LCDs) of proteins have been shown to self-associate, and pathogenic mutations within these domains often drive the proteins into amyloid aggregation associated with disease. These domains may be especially susceptible to amyloidogenic mutations because they are commonly intrinsically disordered and function in self-association. The question therefore arises whether a search for pathogenic mutations in LCDs of the human proteome can lead to identification of other proteins associated with amyloid disease. Here, we take a computational approach to identify documented pathogenic mutations within LCDs that may favor amyloid formation. Using this approach, we identify numerous known amyloidogenic mutations, including several such mutations within proteins previously unidentified as amyloidogenic. Among the latter group, we focus on two mutations within the TRK-fused gene protein (TFG), known to play roles in protein secretion and innate immunity, which are associated with two different peripheral neuropathies. We show that both mutations increase the propensity of TFG to form amyloid fibrils. We therefore conclude that TFG is a novel amyloid protein and propose that the diseases associated with its mutant forms may be amyloidoses.

Keywords: Charcot–Marie–Tooth disease electron microscopy; amyloid; intrinsically disordered protein; low-complexity domain; protein structure.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Amyloid / genetics
  • Amyloid / metabolism
  • Amyloidogenic Proteins* / genetics
  • Amyloidosis* / metabolism
  • Amyloidosis* / pathology
  • Computational Biology*
  • Humans
  • Mutation
  • Proteome / genetics

Substances

  • Amyloid
  • Amyloidogenic Proteins
  • Proteome