Abstract

The current gold-standard method for cancer safety assessment of drugs is a rodent two-year bioassay, which is associated with significant costs and requires testing a high number of animals over lifetime. Due to the absence of a comprehensive set of short-term assays predicting carcinogenicity, new approaches are currently being evaluated. One promising approach is toxicogenomics, which by virtue of genome-wide molecular profiling after compound treatment can lead to an increased mechanistic understanding, and potentially allow for the prediction of a carcinogenic potential via mathematical modeling. The latter typically involves the extraction of informative genes from omics datasets, which can be used to construct generalizable models allowing for the early classification of compounds with unknown carcinogenic potential. Here we formally describe and compare two novel methodologies for the reproducible extraction of characteristic mRNA signatures, which were employed to capture specific gene expression changes observed for nongenotoxic carcinogens. While the first method integrates multiple gene rankings, generated by diverse algorithms applied to data from different subsamplings of the training compounds, the second approach employs a statistical ratio for the identification of informative genes. Both methods were evaluated on a dataset obtained from the toxicogenomics database TG-GATEs to predict the outcome of a two-year bioassay based on profiles from 14-day treatments. Additionally, we applied our methods to datasets from previous studies and showed that the derived prediction models are on average more accurate than those built from the original signatures. The selected genes were mostly related to p53 signaling and to specific changes in anabolic processes or energy metabolism, which are typically observed in tumor cells. Among the genes most frequently incorporated into prediction models were Phlda3, Cdkn1a, Akr7a3, Ccng1 and Abcb4.

Highlights

  • A crucial step in the development of drug candidates is the early exclusion of compounds with carcinogenic potential

  • In this study we introduce and compare two novel feature selection methods which were designed for the problem of extracting informative genes for nongenotoxic carcinogens (NGC)/NC classification from gene expression data (Figure 1)

  • For the Ensemble Feature Selection (EFS) method, we evaluated the performance achieved with different ensembles of feature selection techniques and assessed the stability of the selected informative genes for each gene selection method individually

Read more

Summary

Introduction

A crucial step in the development of drug candidates is the early exclusion of compounds with carcinogenic potential. In addition to high costs and sizeable animal use, this method is known to give false-positive results with respect to human relevance [1] These may for instance arise from spontaneously formed tumors, from rodent-specific modes of carcinogenicity which do not exist in humans, or from toxic doses causing cell injury or death followed by regenerative cell proliferation [2]. For the detection of genotoxicity, a cost-effective short-term test battery, consisting of the Ames test, mouse lymphoma assay, in vitro micronucleus or chromosomal aberration test, is routinely performed in the pharmaceutical industry. This battery of tests is associated with a high number of false positives [4]. There is a great demand for the development of reliable prediction methods, and toxicogenomics may be one method which is worth considering in this respect

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.