Abstract

Spider dragline silk is known for its exceptional strength and toughness; hence understanding the link between its primary sequence and mechanics is crucial. Here, we establish a deep-learning framework to clarify this link in dragline silk. The method utilizes sequence and mechanical property data of dragline spider silk as well as enriching descriptors such as residue-level mobility (B-factor) predictions. Our sequence representation captures the relative position, repetitiveness, as well as descriptors of amino acids that serve to physically enrich the model. We obtain high Pearson correlation coefficients (0.76–0.88) for strength, toughness, and other properties, which show that our B-factor based representation outperforms pure sequence-based models or models that use other descriptors. We prove the utility of our framework by identifying influential motifs and demonstrating how the B-factor serves to pinpoint potential mutations that improve strength and toughness, thereby establishing a validated, predictive, and interpretable sequence model for designing tailored biomaterials.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call