High dimensional surrogacy: computational aspects of an upscaled analysis

Rudradev Sengupta,Nolen Joy Perualila,Ziv Shkedy,Przemyslaw Biecek,Geert Molenberghs,Luc Bijnens

doi:10.1080/10543406.2019.1657128

Abstract

ABSTRACTIdentification of genomic biomarkers is an important area of research in the context of drug discovery experiments. These experiments typically consist of several high dimensional datasets that contain information about a set of drugs (compounds) under development. This type of data structure introduces the challenge of multi-source data integration. High-Performance Computing (HPC) has become an important tool for everyday research tasks. In the context of drug discovery, high dimensional multi-source data needs to be analyzed to identify the biological pathways related to the new set of drugs under development. In order to process all information contained in the datasets, HPC techniques are required. Even though R packages for parallel computing are available, they are not optimized for a specific setting and data structure. In this article, we propose a new framework, for data analysis, to use R in a computer cluster. The proposed data analysis workflow is applied to a multi-source high dimensional drug discovery dataset and compared with a few existing R packages for parallel computing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

High dimensional surrogacy: computational aspects of an upscaled analysis

Abstract

Talk to us

Similar Papers

More From: Journal of Biopharmaceutical Statistics

Lead the way for us

Similar Papers

Multivariate Procedure for Variable Selection and Classification of High Dimensional Heterogeneous Data
Tahir Mehmood ... Zahid Rasheed
Communications for Statistical Applications and Methods | VOL. 22
Tahir Mehmood, et. al.Tahir Mehmood ... Zahid Rasheed
30 Nov 2015
Communications for Statistical Applications and Methods | VOL. 22

H-D and Subspace Clustering of Paradoxical High Dimensional Clinical Datasets with Dimension Reduction Techniques – a Model
S Rajeswari ... M S Josephine
Indian Journal of Science and Technology | VOL. 9
S Rajeswari, et. al.S Rajeswari ... M S Josephine
19 Oct 2016
Indian Journal of Science and Technology | VOL. 9

Features Selection in Statistical Classification of High Dimensional Image Derived Maize (<i>Zea Mays</i> L.) Phenomic Data
Peter Gachoki ... Gladys Njoroge
American Journal of Applied Mathematics and Statistics | VOL. 10
Peter Gachoki, et. al.Peter Gachoki ... Gladys Njoroge
07 Jun 2022
American Journal of Applied Mathematics and Statistics | VOL. 10

Statistical analysis of high-dimensional biomedical data: a gentle introduction to analytical goals, common approaches and challenges
Jörg Rahnenführer ... Eugenia Migliavacca
BMC Medicine | VOL. 21
Jörg Rahnenführer, et. al.Jörg Rahnenführer ... Eugenia Migliavacca
15 May 2023
BMC Medicine | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High dimensional surrogacy: computational aspects of an upscaled analysis

Abstract

Talk to us

Similar Papers

More From: Journal of Biopharmaceutical Statistics