IPDS: A semantic mediator‐based system using Spark for the integration of heterogeneous proteomics data sources

Chaimaa Messaoudi,Hassan Badir,Rachida Fissoune

doi:10.1002/cpe.5814

Abstract

SummaryWith the constant rise of data volumes in many disciplines, various new Big data management systems have emerged to provide scalable tools for efficient data integration, processing, and analysis. In this article, we provide an overview of biomedical data integration systems focusing on ontology‐based semantic systems and Big data technologies based systems such as Apache Spark. We also propose a new semantic data integration system, called Integrated Proteomics Data System (IPDS), which uses a mediator approach. IPDS provides users a unified interface for query processing and data exploration. This system takes advantage of the Apache Spark framework to perform the query transformation and execution needed to question the integrated data sources. We develop a domain ontology that allows the user to formulate its queries in terms defined in the ontology. IPDS is a case study of semantic proteomics data integration linking four data sources UniProt (protein annotation), String (protein‐protein interaction), PDB (protein structure), and Pubmed (biomedical citation).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

IPDS: A semantic mediator‐based system using Spark for the integration of heterogeneous proteomics data sources

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience

Lead the way for us

Journal: Concurrency and Computation: Practice and Experience	Publication Date: May 23, 2020
Citations: 5

Similar Papers

A Mediator Approach for a Semantic Integration of Heterogeneous Proteomics Data Sources
Chaimaa Messaoudi ... Hassan Badir
-
Chaimaa Messaoudi, et. al.Chaimaa Messaoudi ... Hassan Badir
01 Jan 2021
01 Jan 2021

OPSDS: A Semantic Data Integration and Service System Based on Domain Ontology
Xin Liu ... Chungjin Hu
-
Xin Liu, et. al.Xin Liu ... Chungjin Hu
01 Jun 2016
01 Jun 2016

Semantic big biodiversity data integration toolA
...
-
, et. al. ...
01 Jan 2018
01 Jan 2018

Ontology Opportunities and Challenges: Discussions from Semantic Data Integration Perspectives
Abrar Omar Alkhamisi ... Mostafa Saleh
-
Abrar Omar Alkhamisi, et. al.Abrar Omar Alkhamisi ... Mostafa Saleh
01 Mar 2020
01 Mar 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

IPDS: A semantic mediator‐based system using Spark for the integration of heterogeneous proteomics data sources

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience