Data Analysis System Research Articles

Abstract Pancreatic cancer is the third leading cause of cancer-related death in the United States. Current therapeutic options offer a dismal overall survival with the 5-year survival at just ~12%. Analysis of the clinical and molecular underpinnings of pancreatic cancer is critical to developing both early detection methodologies as well as novel therapeutic options. The aggressiveness and deadly nature of this disease warranted the development of a data repository and analysis system dedicated to pancreatic cancer data. The Pancreatic Cancer Action Network’s (PanCAN) SPARK platform is a cloud-based data and analytics platform, powered by Velsera, that integrates real-world patient health data from PanCAN research initiatives and accelerates research by making pancreatic cancer data easier to access and use. Encompassing clinical, imaging, and genomics data from over 600 patients with pancreatic cancer within PanCAN’s Know Your Tumor ® (KYT) precision medicine service, the SPARK platform connects with petabytes of publicly available cancer data via the Cancer Genomics Cloud (CGC), also powered by Velsera. The CGC is part of NCI’s Cancer Research Data Commons (CRDC), a cloud-based data science infrastructure that connects data with analytics tools to allow researchers to share, integrate, analyze, visualize, and drive scientific discovery. Here, we demonstrate the application of these datasets by providing a case study demonstrating how to combine and enrich data to accelerate pancreatic cancer research. Currently, the genomic and proteomic data available on CRDC amounts to 402 and 304 cases of pancreatic tumor samples, respectively. We will use the capabilities of the SPARK and CGC platforms, which provide ready-to-use tools for multi-omics analysis that require no coding knowledge. Using the KYT and CRDC open-access pancreatic cancer data, we aim to demonstrate how to perform integrated analysis of data from diverse scientific domains, and share with collaborators all in one space, streamlining and increasing the potential for new scientific discoveries. Further expansion of the PanCAN and CGC datasets will undoubtedly provide a more comprehensive understanding of pancreatic cancer tumor biology. SPARK and CGC’s cloud based computation infrastructure, along with numerous available cancer datasets and easy-to-use multi-omics data processing workflows and data analytic tools will be instrumental in this process. Citation Format: Kawther Abdilleh, Zelia Worman, Rowan Beck, Cera Fisher, Divya Sain, Jack DiGiovanna, Lynn Matrisian, Sudheer Doss, Brandi Davis-Dusenbery. Leveraging real-world pancreatic cancer datasets to drive drug discovery and patient health [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2024; Part 1 (Regular Abstracts); 2024 Apr 5-10; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2024;84(6_Suppl):Abstract nr 6475.

Read full abstract

The determination of metabolic stability is critical for drug discovery programs, allowing for the optimization of chemical entities and compound prioritization. As such, it is common to perform high-volume in vitro metabolic stability experiments early in the lead optimization process to understand metabolic liabilities. Additional metabolite identification experiments are subsequently performed for a more comprehensive understanding of the metabolic clearance routes to aid medicinal chemists in the structural design of compounds. Collectively, these experiments require extensive sample preparation and a substantial amount of time and resources. To overcome the challenges, a high-throughput integrated assay for simultaneous hepatocyte metabolic stability assessment and metabolite profiling was developed. This assay platform consists of four parts: 1) an automated liquid-handling system for sample preparation and incubation, 2) a liquid chromatography and high-resolution mass spectrometry-based system to simultaneously monitor the parent compound depletion and metabolite formation, 3) an automated data analysis and report system for hepatic clearance assessment; and 4) streamlined autobatch processing for software-based metabolite profiling. The assay platform was evaluated using eight control compounds with various metabolic rates and biotransformation routes in hepatocytes across three species. Multiple sample preparation and data analysis steps were evaluated and validated for accuracy, repeatability, and metabolite coverage. The combined utility of an automated liquid-handling instrument, a high-resolution mass spectrometer, and multiple streamlined data processing software improves the process of these highly demanding screening assays and allows for simultaneous determination of metabolic stability and metabolite profiles for more efficient lead optimization during early drug discovery. SIGNIFICANCE STATEMENT: Metabolic stability assessment and metabolite profiling are pivotal in drug discovery to fully comprehend metabolic liabilities for chemical entity optimization and lead selection. Process of these assays can be repetitive and resource demanding. Here, we developed an integrated hepatocyte stability assay that combines automation, high-resolution mass spectrometers, and batch-processing software to improve and combine the workflow of these assays. The integrated approach allows simultaneous metabolic stability assessment and metabolite profiling, significantly accelerating screening and lead optimization in a resource-effective manner.

Read full abstract

Data Analysis System Research Articles

Related Topics

Articles published on Data Analysis System

Retraction Note: Intelligent Crime Prevention and Control Big Data Analysis System Based on Imaging and Capsule Network Model

Performance of JT-60SA Thomson Scattering data analysis system

Mortality associated with polymyalgia rheumatica in the United States in the 1999-2020 period: a multiple-cause-of-death study.

Ambulatory Long Block: A Model of Precision Education and Assessment for Internal Medicine Residents.

Determination of Gain Scheduling Parameters for Loss of a Feedwater Pump Transient Mitigation Using Neural Networks

Ocean Temperature Profiling Lidar: Analysis of Technology and Potential for Rapid Ocean Observations

Machine Learning Approaches for In-Vehicle Failure Prognosis in Automobiles: A Review

System of complex data analysis of thematic sites ISCAD IS

Analyzing the role of big data and its effects on the retail industry

RABiT-III: an Automated Micronucleus Assay at a Non-Specialized Biodosimetry Facility.

Abstract 6475: Leveraging real-world pancreatic cancer datasets to drive drug discovery and patient health

Design and implementation of a Li River water quality monitoring and analysis system based on outlier data analysis.

Improving simulations of extreme precipitation events in China by the CMIP6 global climate models through statistical downscaling

A Model for Predicting Physical Health of College Students Based on Semantic Web and Deep Learning Under Cloud Edge Collaborative Architecture

An Integrated Hepatocyte Stability Assay for Simultaneous Metabolic Stability Assessment and Metabolite Profiling.

A non-invasive, low-cost and easy-to-operate underwater terrain monitoring method

АРХІТЕКТУРА ПРОГРАМНОЇ СИСТЕМИ ДЛЯ ВИРІШЕННЯ ЗАДАЧІ КЛАСИФІКАЦІЇ НА ОСНОВІ ПРИВАТНИХ ДАНИХ

Conversational agent in HCI a review

Development of regional pharmacy intravenous admixture services data reporting and analysis platform for enhanced quality control ability

Real-time sensor networks based on genetic algorithms application in the analysis of innovative data in cultural industry management

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Data Analysis System Research Articles

Related Topics

Articles published on Data Analysis System

Retraction Note: Intelligent Crime Prevention and Control Big Data Analysis System Based on Imaging and Capsule Network Model

Performance of JT-60SA Thomson Scattering data analysis system

Mortality associated with polymyalgia rheumatica in the United States in the 1999-2020 period: a multiple-cause-of-death study.

Ambulatory Long Block: A Model of Precision Education and Assessment for Internal Medicine Residents.

Determination of Gain Scheduling Parameters for Loss of a Feedwater Pump Transient Mitigation Using Neural Networks

Ocean Temperature Profiling Lidar: Analysis of Technology and Potential for Rapid Ocean Observations

Machine Learning Approaches for In-Vehicle Failure Prognosis in Automobiles: A Review

System of complex data analysis of thematic sites ISCAD IS

Analyzing the role of big data and its effects on the retail industry

RABiT-III: an Automated Micronucleus Assay at a Non-Specialized Biodosimetry Facility.

Abstract 6475: Leveraging real-world pancreatic cancer datasets to drive drug discovery and patient health

Design and implementation of a Li River water quality monitoring and analysis system based on outlier data analysis.

Improving simulations of extreme precipitation events in China by the CMIP6 global climate models through statistical downscaling

A Model for Predicting Physical Health of College Students Based on Semantic Web and Deep Learning Under Cloud Edge Collaborative Architecture

An Integrated Hepatocyte Stability Assay for Simultaneous Metabolic Stability Assessment and Metabolite Profiling.

A non-invasive, low-cost and easy-to-operate underwater terrain monitoring method

АРХІТЕКТУРА ПРОГРАМНОЇ СИСТЕМИ ДЛЯ ВИРІШЕННЯ ЗАДАЧІ КЛАСИФІКАЦІЇ НА ОСНОВІ ПРИВАТНИХ ДАНИХ

Conversational agent in HCI a review

Development of regional pharmacy intravenous admixture services data reporting and analysis platform for enhanced quality control ability

Real-time sensor networks based on genetic algorithms application in the analysis of innovative data in cultural industry management