Abstract

The light chain (AL) amyloidosis is caused by the aggregation of light chain of antibodies into amyloid fibrils. There are plenty of computational resources available for the prediction of short aggregation-prone regions within proteins. However, it is still a challenging task to predict the amyloidogenic nature of the whole protein using sequence/structure information. In the case of antibody light chains, common architecture and known binding sites can provide vital information for the prediction of amyloidogenicity at physiological conditions. Here, in this work, we have compared classical sequence-based, aggregation-related features (such as hydrophobicity, presence of gatekeeper residues, disorderness, β-propensity, etc.) calculated for the CDR, FR or VL regions of amyloidogenic and non-amyloidogenic antibody light chains and implemented the insights gained in a machine learning-based webserver called “VLAmY-Pred” (https://web.iitm.ac.in/bioinfo2/vlamy-pred/). The model shows prediction accuracy of 79.7% (sensitivity: 78.7% and specificity: 79.9%) with a ROC value of 0.88 on a dataset of 1828 variable region sequences of the antibody light chains. This model will be helpful towards improved prognosis for patients that may likely suffer from diseases caused by light chain amyloidosis, understanding origins of aggregation in antibody-based biotherapeutics, large-scale in-silico analysis of antibody sequences generated by next generation sequencing, and finally towards rational engineering of aggregation resistant antibodies.

Highlights

  • The light chain (AL) amyloidosis is caused by the aggregation of light chain of antibodies into amyloid fibrils

  • The key observation in the aggregation capability analysis due to common architecture of antibodies includes (i) the hydrophobicity of the complementarity determining regions (CDRs) region in amyloidogenic light chains is higher, (ii) the percentage of gatekeeper residues is higher in framework regions (FRs) region of non-amyloidogenic light chains. (iii) The disorderness in variable region ­(VL) is higher for amyloidogenic light chains

  • The sequence conservation analysis showed that the amyloidogenic light chain dataset in kappa (κ) had relatively higher sequence conservation, potentially, to maintain the amyloidogenicity

Read more

Summary

Introduction

The light chain (AL) amyloidosis is caused by the aggregation of light chain of antibodies into amyloid fibrils. The model shows prediction accuracy of 79.7% (sensitivity: 78.7% and specificity: 79.9%) with a ROC value of 0.88 on a dataset of 1828 variable region sequences of the antibody light chains. This model will be helpful towards improved prognosis for patients that may likely suffer from diseases caused by light chain amyloidosis, understanding origins of aggregation in antibody-based biotherapeutics, largescale in-silico analysis of antibody sequences generated by generation sequencing, and towards rational engineering of aggregation resistant antibodies. To predict the solubility and identify the aggregation hotspots within amyloid-forming proteins These algorithms have utilized sequence and structure-based properties such as patterns of hydrophobic and polar residues, β-strand propensity, charge, ability to form cross-β motif, aggregation propensity scales determined from experimental data, solvent-exposed hydrophobic patches on molecular surface and so on. Liaw et al.[28] proposed a method using Random Forests classifier with dipeptide composition, which discriminated amyloidogenic and non-amyloidogenic antibody light chains

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call