A systematic exploration of [Formula: see text] cutoff ranges in machine learning models for protein mutation stability prediction.

Richard Olney,Aaron Tuor,Brian Hutchinson,Filip Jagodzinski

doi:10.1142/s021972001840022x

Abstract

Discerning how a mutation affects the stability of a protein is central to the study of a wide range of diseases. Mutagenesis experiments on physical proteins provide precise insights about the effects of amino acid substitutions, but such studies are time and cost prohibitive. Computational approaches for informing experimentalists where to allocate wet-lab resources are available, including a variety of machine learning models. Assessing the accuracy of machine learning models for predicting the effects of mutations is dependent on experiments for amino acid substitutions performed in vitro. When similar experiments on physical proteins have been performed by multiple laboratories, the use of the data near the juncture of stabilizing and destabilizing mutations is questionable. In this work, we explore a systematic and principled alternative to discarding experimental data close to the juncture of stabilizing and destabilizing mutations. We model the inconclusive range of experimental [Formula: see text] values via 3- and 5-way classifiers, and systematically explore potential boundaries for the range of inconclusive experimental values. We demonstrate the effectiveness of potential boundaries through confusion matrices and heat map visualizations. We explore two novel metrics for assessing viable cutoff ranges, and find that under these metrics, a lower cutoff near [Formula: see text] and an upper cutoff near [Formula: see text] are optimal across multiple machine learning models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A systematic exploration of [Formula: see text] cutoff ranges in machine learning models for protein mutation stability prediction.

Abstract

Talk to us

Similar Papers

More From: Journal of bioinformatics and computational biology

Lead the way for us

Journal: Journal of bioinformatics and computational biology	Publication Date: Oct 1, 2018
Citations: 1

Similar Papers

Development of A Machine Learning Model for Predicting Unanticipated Difficult Tracheal Intubation
Bin Wang ... Yongquan Chen
Journal of Anesthesia and Translational Medicine | VOL. 1
Bin Wang, et. al.Bin Wang ... Yongquan Chen
01 Jan 2021
Journal of Anesthesia and Translational Medicine | VOL. 1

Evaluating The Performance of Machine Learning Models in Audit Opinion Prediction – A Study in Vietnam
Dang Dinh Tan
Engineering and Technology Journal | VOL. 09
Dang Dinh TanDang Dinh Tan
30 Oct 2024
Engineering and Technology Journal | VOL. 09

Value of the application of enhanced CT radiomics and machine learning in preoperative prediction of microvascular invasion in hepatocellular carcinoma
X M Wang ... Y Zhang
Zhonghua yi xue za zhi | VOL. 101
X M Wang, et. al.X M Wang ... Y Zhang
11 May 2021
Zhonghua yi xue za zhi | VOL. 101

A computed tomography urography-based machine learning model for predicting preoperative pathological grade of upper urinary tract urothelial carcinoma.
Yanghuang Zheng ... Jinsong Zhang
Cancer medicine | VOL. 13
Yanghuang Zheng, et. al.Yanghuang Zheng ... Jinsong Zhang
01 Jan 2024
Cancer medicine | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A systematic exploration of [Formula: see text] cutoff ranges in machine learning models for protein mutation stability prediction.

Abstract

Talk to us

Similar Papers

More From: Journal of bioinformatics and computational biology