Abstract

Monoclonal antibodies (mAbs) constitute a rapidly growing biopharmaceutical sector. However, their growth is impeded by developability issues such as polyspecificity and lack of solubility, which leads to attrition as well as manufacturing failures. In this study a multitool hybrid quantitative structure-activity relationship (QSAR) model development framework is described. This framework uses four novel datasets derived from the primary sequences of IgG1-κ-humanized mAbs with varying degrees of resolutions. Unsupervised pattern recognition is first performed on the descriptor sets to visualize any intrinsic property-based clustering, followed by regression of descriptors against cross-interaction chromatography (CIC) retention times. Model optimization is performed via unsupervised variable reduction followed by supervised variable selection. Finally, the models and datasets are benchmarked based on the regression model performance metrics such as R2 , Q2 , and RMSE. The results show that datasets containing localized descriptors rather than averaged value over the entire protein have better predictive performance of CIC retention behavior with R2 > 0.8 and RMSE < 0.3. Furthermore, the results indicate the physicochemical, electronic, and topological properties of hypervariable regions of antibodies that contribute most to the CIC retention times. The results of these studies could contribute to early-stage screening and better design of mAbs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.