Development Of Quantitative Structure-activity Relationship Models Research Articles

IntroductionThe current effort towards the digital transformation across multiple scientific domains requires data that is Findable, Accessible, Interoperable and Reusable (FAIR). In addition to the FAIR data, what is required for the application of computational tools, such as Quantitative Structure Activity Relationships (QSARs), is a sufficient data volume and the ability to merge sources into homogeneous digital assets. In the nanosafety domain there is a lack of FAIR available metadata. MethodologyTo address this challenge, we utilized 34 datasets from the nanosafety domain by exploiting the NanoSafety Data Reusability Assessment (NSDRA) framework, which allowed the annotation and assessment of dataset's reusability. From the framework's application results, eight datasets targeting the same endpoint (i.e. numerical cellular viability) were selected, processed and merged to test several hypothesis including universal versus nanogroup-specific QSAR models (metal oxide and nanotubes), and regression versus classification Machine Learning (ML) algorithms. ResultsUniversal regression and classification QSARs reached an 0.86 R2 and 0.92 accuracy, respectively, for the test set. Nanogroup-specific regression models reached 0.88 R2 for nanotubes test set followed by metal oxide (0.78). Nanogroup-specific classification models reached 0.99 accuracy for nanotubes test set, followed by metal oxide (0.91). Feature importance revealed different patterns depending on the dataset with common influential features including core size, exposure conditions and toxicological assay.Even in the case where the available experimental knowledge was merged, the models still failed to correctly predict the outputs of an unseen dataset, revealing the cumbersome conundrum of scientific reproducibility in realistic applications of QSAR for nanosafety. To harness the full potential of computational tools and ensure their long-term applications, embracing FAIR data practices is imperative in driving the development of responsible QSAR models. ConclusionsThis study reveals that the digitalization of nanosafety knowledge in a reproducible manner has a long way towards its successful pragmatic implementation. The workflow carried out in the study shows a promising approach to increase the FAIRness across all the elements of computational studies, from dataset's annotation, selection, merging to FAIR modeling reporting. This has significant implications for future research as it provides an example of how to utilize and report different tools available in the nanosafety knowledge system, while increasing the transparency of the results. One of the main benefits of this workflow is that it promotes data sharing and reuse, which is essential for advancing scientific knowledge by making data and metadata FAIR compliant. In addition, the increased transparency and reproducibility of the results can enhance the trustworthiness of the computational findings.

Quantitative structure-activity relationship (QSAR) and read-across techniques have recently been merged into a new emerging field of read-across structure-activity relationship (RASAR) that uses the chemical similarity concepts of read-across (an unsupervised step) and finally develops a supervised learning model (like QSAR). The RASAR method has so far been used only in case of graded predictions or classification modeling. In this work, we attempt, for the first time, to apply RASAR for quantitative predictions (q-RASAR) using a case study of androgen receptor binding affinity data. We have computed a number of error-based and similarity-based measures such as weighted standard deviation of the predicted values, coefficient of variation of the computed predictions, average similarity level of close training compounds for each query molecule, standard deviation and coefficient of variation of similarity levels, maximum similarity levels to positive and negative close training compounds, a concordance measure indicating similarity to positive, negative or both classes of close training compounds, etc. We have clubbed these additional measures along with the selected chemical descriptors from the previously developed QSAR model and redeveloped new partial least squares models from the training set, and predicted the endpoint using the query data set. Interestingly, these new models outperform the internal and external validation quality of the original QSAR model. In this study, we have also introduced a new similarity-based concordance measure (Banerjee-Roy coefficient) that can significantly contribute to the model quality. A q-RASAR model also has the advantage over read-across predictions in providing easy interpretation and indicating quantitative contributions of important chemical features. The strategy described here should be applicable to other biological/toxicological/property data modeling for enhanced quality of predictions, easy interpretability, and efficient transferability.

Development Of Quantitative Structure-activity Relationship Models Research Articles

Related Topics

Articles published on Development Of Quantitative Structure-activity Relationship Models

A data reusability assessment in the nanosafety domain based on the NSDRA framework followed by an exploratory quantitative structure activity relationships (QSAR) modeling targeting cellular viability

Machine learning-driven QSAR models for predicting the mixture toxicity of nanoparticles

MicotoXilico: An Interactive Database to Predict Mutagenicity, Genotoxicity, and Carcinogenicity of Mycotoxins.

Application of QSAR Approach to Assess the Effects of Organic Pollutants on Bacterial Virulence Factors.

2D-QSAR study and design of novel pyrazole derivatives as an anticancer lead compound against A-549, MCF-7, HeLa, HepG-2, PaCa-2, DLD-1

Quantitative structure–activity relationship models for predicting apparent rate constants of organic compounds with ferrate (VI)

Development of predictive QSAR models for the substrates/inhibitors of OATP1B1 by deep neural networks

How the Structure of Per- and Polyfluoroalkyl Substances (PFAS) Influences Their Binding Potency to the Peroxisome Proliferator-Activated and Thyroid Hormone Receptors-An In Silico Screening Study.

Predicting Endocrine Disruption Using Conformal Prediction - APrioritization Strategy to Identify Hazardous Chemicals with Confidence.

HDAC1 PREDICTOR: a simple and transparent application for virtual screening of histone deacetylase 1 inhibitors

Development of QSAR models to predict blood-brain barrier permeability.

Predictive modeling of antibacterial activity of ionic liquids by machine learning methods

Application of QSAR for investigation on coagulation mechanisms of textile wastewater.

Development of new QSAR models for water, sediment, and soil half-life.

Development of QSAR model using machine learning and molecular docking study of polyphenol derivatives against obesity as pancreatic lipase inhibitor

Development of QSAR Model of Caffeic Acid Phenethyl Ester as Anti-Cancer HT-29

Prediction of aquatic toxicity of energetic materials using genetic function approximation

First report of q-RASAR modeling toward an approach of easy interpretability and efficient transferability.

Chemometric modeling of acute toxicity of diverse aromatic compounds against Rana japonica

Unveil the quantum chemical descriptors determining direct photodegradation of antibiotics under simulated sunlight: Batch experiments and model development

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Development Of Quantitative Structure-activity Relationship Models Research Articles

Related Topics

Articles published on Development Of Quantitative Structure-activity Relationship Models

A data reusability assessment in the nanosafety domain based on the NSDRA framework followed by an exploratory quantitative structure activity relationships (QSAR) modeling targeting cellular viability

Machine learning-driven QSAR models for predicting the mixture toxicity of nanoparticles

MicotoXilico: An Interactive Database to Predict Mutagenicity, Genotoxicity, and Carcinogenicity of Mycotoxins.

Application of QSAR Approach to Assess the Effects of Organic Pollutants on Bacterial Virulence Factors.

2D-QSAR study and design of novel pyrazole derivatives as an anticancer lead compound against A-549, MCF-7, HeLa, HepG-2, PaCa-2, DLD-1

Quantitative structure–activity relationship models for predicting apparent rate constants of organic compounds with ferrate (VI)

Development of predictive QSAR models for the substrates/inhibitors of OATP1B1 by deep neural networks

How the Structure of Per- and Polyfluoroalkyl Substances (PFAS) Influences Their Binding Potency to the Peroxisome Proliferator-Activated and Thyroid Hormone Receptors-An In Silico Screening Study.

Predicting Endocrine Disruption Using Conformal Prediction - APrioritization Strategy to Identify Hazardous Chemicals with Confidence.

HDAC1 PREDICTOR: a simple and transparent application for virtual screening of histone deacetylase 1 inhibitors

Development of QSAR models to predict blood-brain barrier permeability.

Predictive modeling of antibacterial activity of ionic liquids by machine learning methods

Application of QSAR for investigation on coagulation mechanisms of textile wastewater.

Development of new QSAR models for water, sediment, and soil half-life.

Development of QSAR model using machine learning and molecular docking study of polyphenol derivatives against obesity as pancreatic lipase inhibitor

Development of QSAR Model of Caffeic Acid Phenethyl Ester as Anti-Cancer HT-29

Prediction of aquatic toxicity of energetic materials using genetic function approximation

First report of q-RASAR modeling toward an approach of easy interpretability and efficient transferability.

Chemometric modeling of acute toxicity of diverse aromatic compounds against Rana japonica

Unveil the quantum chemical descriptors determining direct photodegradation of antibiotics under simulated sunlight: Batch experiments and model development