Integrated Cheminformatics and Bioinformatics

J Elands

doi:10.1016/s1535-5535(04)00142-x

Abstract

For every experimental compound in the drug discovery pipeline, pharmaceutical companies generate huge amounts of data: chemical and physical data derived from various analytical techniques, and biological data that flow out of large-scale screening programs as well as lead optimization work. Drug discovery efforts have been hampered not by a lack of lead compounds or a dearth of experimental data, but by the need for effective and efficient computational tools to collect, store, manipulate, and analyze large amounts of data. Scientists in many of the major pharmaceutical and biotechnology companies, including GlaxoSmithKline, Aventis, AstraZeneca, Hoffmann-La Roche, Merck, Novartis, Millennium, Exelixis, and Immunex, Cytokinetics, Evotec and Monsanto are using ActivityBase (Figure 1), an integrated data management system, to collect and analyze data generated by high throughput screening (HTS), to store chemical structures and register novel compounds, and to integrate cheminformatic and bioinformatic datasets. ActivityBase manages data produced by HTS (with sustained screening volumes exceeding 30,000 to 40,000 wells/working day) and ultra-HTS (sustained volumes of greater than 100,000 wells/working day) and some companies are populating ActivityBase databases at the rate of approximately 20 million data points per six months. Many operational databases exceed tens of millions of rows, and the software’s search engine can respond to typical queries in a matter of seconds. An abundance of data does not necessarily add value to an experimental compound. The data do not imply therapeutic efficacy, infer bioavailability, predict toxicity, or suggest drug-like properties. Successful discovery research depends on the ability to integrate diverse datasets from multiple sources and to extract information from raw data. It is this information that will guide and expedite decision-making, improve productivity, and add value. It is this information that will allow a company to decide whether to pursue a lead compound or to “fail” it early in the discovery process. ActivityBase is based on IDBS’ generic data model designed for discovery research and can capture, manage, and store data from biological, chemical, and robotic systems. The ActivityBase 5.0 Suite seamlessly integrates cheminformatic and bioinformatic data. It provides the framework for converting data into information that can be applied to lead discovery and optimization processes. (Figure 2) New functionalities introduced in version 5.0 enhance the flexibility of data collection and analysis and expand data integration capabilities. Joining AssayBase, which manages biological data are three new software modules: · StructureBase for registering chemical compounds and searching molecular structures and related physicochemical data. (Figure 3) · ReactionBase for storing, managing, and searching chemical reactions and reaction schemes. (Figure 4) · Natural Products for managing the process of isolating active compounds from natural materials; it generates a genealogic trail that tracks the derivation of new chemical compounds from natural products.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Integrated Cheminformatics and Bioinformatics

Abstract

Talk to us

Similar Papers

More From: Journal of the Association for Laboratory Automation

Lead the way for us

Similar Papers

The case for open‐access chemical biology
Johan Weigelt
EMBO reports | VOL. 10
Johan WeigeltJohan Weigelt
21 Aug 2009
EMBO reports | VOL. 10

Abstract 5774: Novel cell-based high-throughput hybridoma screening method using the Celigo image cytometer for antibody discovery
Leo L Chan ... Simon C Robson
Cancer Research | VOL. 78
Leo L Chan, et. al.Leo L Chan ... Simon C Robson
01 Jul 2018
Cancer Research | VOL. 78

Metabolic stability for drug discovery and development: pharmacokinetic and biochemical challenges.
Collen M Masimirembwa ... Ulf Bredberg
Clinical Pharmacokinetics | VOL. 42
Collen M Masimirembwa, et. al.Collen M Masimirembwa ... Ulf Bredberg
01 Jan 2003
Clinical Pharmacokinetics | VOL. 42

AML-377 A “Designed” High-Throughput Drug Screening Strategy Identifies Aurora Kinase A Inhibitors as Promising Preclinical Candidates for the Treatment of NPM1-Mutated AML
Roberta Ranieri ... Serenella Silvestri
Clinical Lymphoma Myeloma and Leukemia | VOL. 22
Roberta Ranieri, et. al.Roberta Ranieri ... Serenella Silvestri
01 Oct 2022
Clinical Lymphoma Myeloma and Leukemia | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integrated Cheminformatics and Bioinformatics

Abstract

Talk to us

Similar Papers

More From: Journal of the Association for Laboratory Automation