Robust Co-clustering to Discover Toxicogenomic Biomarkers and Their Regulatory Doses of Chemical Compounds Using Logistic Probabilistic Hidden Variable Model.

Mohammad Nazmol Hasan,Anjuman Ara Begum,Md Masud Rana,Md Nurul Haque Mollah,Moizur Rahman

doi:10.3389/fgene.2018.00516

Mohammad Nazmol Hasan, Anjuman Ara Begum + Show 3 more

Open Access

https://doi.org/10.3389/fgene.2018.00516

Copy DOI

Abstract

Detection of biomarker genes and their regulatory doses of chemical compounds (DCCs) is one of the most important tasks in toxicogenomic studies as well as in drug design and development. There is an online computational platform “Toxygates” to identify biomarker genes and their regulatory DCCs by co-clustering approach. Nevertheless, the algorithm of that platform based on hierarchical clustering (HC) does not share gene-DCC two-way information simultaneously during co-clustering between genes and DCCs. Also it is sensitive to outlying observations. Thus, this platform may produce misleading results in some cases. The probabilistic hidden variable model (PHVM) is a more effective co-clustering approach that share two-way information simultaneously, but it is also sensitive to outlying observations. Therefore, in this paper we have proposed logistic probabilistic hidden variable model (LPHVM) for robust co-clustering between genes and DCCs, since gene expression data are often contaminated by outlying observations. We have investigated the performance of the proposed LPHVM co-clustering approach in a comparison with the conventional PHVM and Toxygates co-clustering approaches using simulated and real life TGP gene expression datasets, respectively. Simulation results show that the proposed method improved the performance over the conventional PHVM in presence of outliers; otherwise, it keeps equal performance. In the case of real life TGP data analysis, three DCCs (glibenclamide-low, perhexilline-low, and hexachlorobenzene-medium) for glutathione metabolism pathway dataset as well as two DCCs (acetaminophen-medium and methapyrilene-low) for PPAR signaling pathway dataset were incorrectly co-clustered by the Toxygates online platform, while only one DCC (hexachlorobenzene-low) for glutathione metabolism pathway was incorrectly co-clustered by the proposed LPHVM approach. Our findings from the real data analysis are also supported by the other findings in the literature.

Highlights

Toxicogenomics studies combines toxicology with several omics technologies to assess the risk of toxins and chemical agents in organism (NRC, 2007; Afshari et al, 2011)
We investigate the performance of our proposed method (LPHVM) by comparing it with the conventional probabilistic hidden variable model (PHVM) using simulated datasets D1 and D2 in absence and presence of outlying observations for robust co-clustering between genes and doses of chemical compounds (DCCs) to discover biomarker genes and their regulatory DCCs
Every time of data simulation outliers are introduced in the dataset using the data contamination methods Tukey– Huber contamination model (THCM) and independent contamination model (ICM) at the same time error rate (ER) are calculated for PHVM and logistic probabilistic hidden variable model (LPHVM) applying these methods on the datasets

Summary

Introduction

Toxicogenomics studies combines toxicology with several omics technologies (genomics, transcriptomics, proteomics, and metabolomics) to assess the risk of toxins (small molecules, peptides, or proteins) and chemical agents (drugs, gasoline, alcohol, pesticides, fuel oil, and cosmetics) in organism (NRC, 2007; Afshari et al, 2011). Through integration of these omics technologies with bioinformatics, toxicogenomics can be used to suggest the molecular mechanism of toxicity. These toxicogenomic biomarkers can be identified from the extensive gene-treatment expression dataset of target organs of individuals (Fielden et al, 2007; Uehara et al, 2008; Igarashi et al, 2015)

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Genetics	Publication Date: Nov 1, 2018
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Robust Co-clustering to Discover Toxicogenomic Biomarkers and Their Regulatory Doses of Chemical Compounds Using Logistic Probabilistic Hidden Variable Model.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Genetics

Lead the way for us

Similar Papers

Performance Improvement of Gene Selection Methods using Outlier Modification Rule
Md Shahjaman ... Nishith Kumar
Current bioinformatics | VOL. 14
Md Shahjaman, et. al.Md Shahjaman ... Nishith Kumar
16 Jul 2019
Current bioinformatics | VOL. 14

Model-based characterization of the equilibrium dynamics of transcription initiation and promoter-proximal pausing in human cells.
Yixin Zhao ... Adam Siepel
Nucleic acids research | VOL. 51
Yixin Zhao, et. al.Yixin Zhao ... Adam Siepel
27 Oct 2023
Nucleic acids research | VOL. 51

A novel Markov Blanket-based repeated-fishing strategy for capturing phenotype-related biomarkers in big omics data.
Hongkai Li ... Xiaoshuai Zhang
BMC genetics | VOL. 17
Hongkai Li, et. al.Hongkai Li ... Xiaoshuai Zhang
09 Mar 2016
BMC genetics | VOL. 17

Correlated geometric models of order k and its application to intensive care unit and leprosy data.
Roberta De Souza ... Carlos Alberto Ribeiro Diniz
Statistics in medicine | VOL. 41
Roberta De Souza, et. al.Roberta De Souza ... Carlos Alberto Ribeiro Diniz
04 Jan 2022
Statistics in medicine | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Co-clustering to Discover Toxicogenomic Biomarkers and Their Regulatory Doses of Chemical Compounds Using Logistic Probabilistic Hidden Variable Model.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Genetics