Target Schema Research Articles

Data integration is an important step in integrating information from multiple sources. The problem is how to find and combine data from scattered data sources that are heterogeneous and have semantically informant interconnections optimally. The heterogeneity of data sources is the result of a number of factors, including storing databases in different formats, using different software and hardware for database storage systems, designing in different data semantic models (Katsis & Papakonstantiou, 2009, Ziegler & Dittrich , 2004). Nowadays there are two approaches in doing data integration that is Global as View (GAV) and Local as View (LAV), but both have different advantages and limitations so that proper analysis is needed in its application. Some of the major factors to be considered in making efficient and effective data integration of heterogeneous data sources are the understanding of the type and structure of the source data (source schema). Another factor to consider is also the view type of integration result (target schema). The results of the integration can be displayed into one type of global view or a variety of other views. So in integrating data whose source is structured the approach will be different from the integration of the data if the data source is not structured or semi-structured. Scheme mapping is a specific declaration that describes the relationship between the source scheme and the target scheme. In the scheme mapping is expressed in in some logical formulas that can help applications in data interoperability, data exchange and data integration. In this paper, in the case of establishing a patient referral center data center, it requires integration of data whose source is derived from a number of different health facilities, it is necessary to design a schema mapping system (to support optimization). Data Center as the target orientation schema (target schema) from various reference service units as a source schema (source schema) has the characterization and nature of data that is structured and independence. So that the source of data can be integrated tersetruktur of the data source into an integrated view (as a data center) with an equivalent query rewriting (equivalent). The data center as a global schema serves as a schema target requires a "mediator" that serves "guides" to maintain global schemes and map (mapping) between global and local schemes. Data center as from Global As View (GAV) here tends to be single and unified view so to be effective in its integration process with various sources of schema which is needed integration facilities "integration". The "Pemadu" facility is a declarative mapping language that allows to specifically link each of the various schema sources to the data center. So that type of query rewriting equivalent is suitable to be applied in the context of query optimization and maintenance of physical data independence.Keywords: Global as View (GAV), Local as View (LAV), source schema ,mapping schema

Read full abstract

This article is part of the Focus Theme of METHODS of Information in Medicine on "Managing Interoperability and Complexity in Health Systems". The need for complementary access to multiple RDF databases has fostered new lines of research, but also entailed new challenges due to data representation disparities. While several approaches for RDF-based database integration have been proposed, those focused on schema alignment have become the most widely adopted. All state-of-the-art solutions for aligning RDF-based sources resort to a simple technique inherited from legacy relational database integration methods. This technique - known as element-to-element (e2e) mappings - is based on establishing 1:1 mappings between single primitive elements - e.g. concepts, attributes, relationships, etc. - belonging to the source and target schemas. However, due to the intrinsic nature of RDF - a representation language based on defining tuples < subject, predicate, object > -, one may find RDF elements whose semantics vary dramatically when combined into a view involving other RDF elements - i.e. they depend on their context. The latter cannot be adequately represented in the target schema by resorting to the traditional e2e approach. These approaches fail to properly address this issue without explicitly modifying the target ontology, thus lacking the required expressiveness for properly reflecting the intended semantics in the alignment information. To enhance existing RDF schema alignment techniques by providing a mechanism to properly represent elements with context-dependent semantics, thus enabling users to perform more expressive alignments, including scenarios that cannot be adequately addressed by the existing approaches. Instead of establishing 1:1 correspondences between single primitive elements of the schemas, we propose adopting a view-based approach. The latter is targeted at establishing mapping relationships between RDF subgraphs - that can be regarded as the equivalent of views in traditional databases -, rather than between single schema elements. This approach enables users to represent scenarios defined by context-dependent RDF elements that cannot be properly represented when adopting the currently existing approaches. We developed a software tool implementing our view-based strategy. Our tool is currently being used in the context of the European Commission funded p-medicine project, targeted at creating a technological framework to integrate clinical and genomic data to facilitate the development of personalized drugs and therapies for cancer, based on the genetic profile of the patient. We used our tool to integrate different RDF-based databases - including different repositories of clinical trials and DICOM images - using the Health Data Ontology Trunk (HDOT) ontology as the target schema. The importance of database integration methods and tools in the context of biomedical research has been widely recognized. Modern research in this area - e.g. identification of disease biomarkers, or design of personalized therapies - heavily relies on the availability of a technical framework to enable researchers to uniformly access disparate repositories. We present a method and a tool that implement a novel alignment method specifically designed to support and enhance the integration of RDF-based data sources at schema (metadata) level. This approach provides an increased level of expressiveness compared to other existing solutions, and allows solving heterogeneity scenarios that cannot be properly represented using other state-of-the-art techniques.

Read full abstract

Target Schema Research Articles

Related Topics

Articles published on Target Schema

Querying Data Exchange Settings Beyond Positive Queries

Integration of Building Information Modeling Interoperability into Nonlinear Finite Element Analysis of Bridge Substructures

Schema mapping generation in the wild

Active Learning for Knowledge Graph Schema Expansion

Glean

Interoperable Visualization Framework Towards Enhancing Mapping and Integration of Official Statistics

SMAT: An attention-based deep learning solution to the automation of schema matching.

A Study on Information-Preserving Schema Transformations

Temporal data exchange

Incorporating Data Context to Cost-Effectively Automate End-to-End Data Wrangling

CDI: Configurable Data Integration Using Property Precedence Relations

Approximate top-K answering under uncertain schema mappings

DATA INTEGRATION MODEL DESIGN FOR SUPPORTING DATA CENTER PATIENT SERVICES DISTRIBUTED INSURANCE PURCHASE WITH VIEW BASED DATA INTEGRATION

SEDEX: Scalable Entity Preserving Data Exchange

A Probabilistic Approach to Knowledge Translation

Fuzzy data exchange

Translating Relational Database Schemas into Object-based Schemas: University Case Study

Ontology-based mappings

Toward a view-oriented approach for aligning RDF-based biomedical repositories.

Optimizing the chase

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Target Schema Research Articles

Related Topics

Articles published on Target Schema

Querying Data Exchange Settings Beyond Positive Queries

Integration of Building Information Modeling Interoperability into Nonlinear Finite Element Analysis of Bridge Substructures

Schema mapping generation in the wild

Active Learning for Knowledge Graph Schema Expansion

Glean

Interoperable Visualization Framework Towards Enhancing Mapping and Integration of Official Statistics

SMAT: An attention-based deep learning solution to the automation of schema matching.

A Study on Information-Preserving Schema Transformations

Temporal data exchange

Incorporating Data Context to Cost-Effectively Automate End-to-End Data Wrangling

CDI: Configurable Data Integration Using Property Precedence Relations

Approximate top-K answering under uncertain schema mappings

DATA INTEGRATION MODEL DESIGN FOR SUPPORTING DATA CENTER PATIENT SERVICES DISTRIBUTED INSURANCE PURCHASE WITH VIEW BASED DATA INTEGRATION

SEDEX: Scalable Entity Preserving Data Exchange

A Probabilistic Approach to Knowledge Translation

Fuzzy data exchange

Translating Relational Database Schemas into Object-based Schemas: University Case Study

Ontology-based mappings

Toward a view-oriented approach for aligning RDF-based biomedical repositories.

Optimizing the chase