Triple Store Research Articles

The Resource Description Framework (RDF) is widely used to model web data. The scale and complexity of the modeled data emphasized performance challenges on the RDF-triple stores. Workload adaption is one important strategy to deal with those challenges on the storage level. Current workload-adaption approaches lack the necessary generalization of the problem and only optimize part of the storage layer with the workload (mostly the replication). This creates a big performance gap within other data structures (e.g. indexes and cache) that could heavily benefit from the same workload adaption strategy. Moreover, the workload statistics are built collectively in most of the current approaches. Thus, the analysis process is unaware of whether workloads’ items are old or recent. However, that does not simulate the temporal trends that exist naturally in user queries which causes the analysis process to lag behind the rapid workload development. We present a novel universal adaption approach to the storage management of a distributed RDF store. The system aims to find optimal data assignments to the different indexes, replications, and join cache within the limited storage space. We present a cost model based on the workload that often contains frequent patterns. The workload is dynamically and continuously analyzed to evaluate predefined rules considering the benefits and costs of all options of assigning data to the storage structures. The objective is to reduce query execution time by letting different data containers compete on the limited storage space. By modeling the workload statistics as time series, we can apply well-known smoothing techniques allowing the importance of the workload to decay over time. That allows the universal adaption to stay tuned with potential changes in the workload trends.

Read full abstract

As the volume and ubiquity of graphs increase, a compact graph representation becomes essential for enabling efficient storage, transfer, and processing of graphs. Given a graph, the graph summarization problem asks for a compact representation that consists of a summary graph and the corrections, such that we can recreate the original graph from the representation exactly. Although this problem has been studied extensively, the existing works either trade summary compactness for efficiency, or vice versa. In particular, a well-known greedy method provides the most compact summary but incurs prohibitive time cost, while the state-of-the-art algorithms with practical overheads are more than 20% behind in summary compactness in our comparison with the greedy method. This paper presents Mags and Mags-DM, two algorithms that aim to bridge the compactness and efficiency in graph summarization. Mags adopts the existing greedy paradigm that provides state-of-the-art compactness, but significantly improves its efficiency with a novel algorithm design. Meanwhile, Mags-DM follows a different paradigm with practical efficiency and overcomes its limitations in compactness. Moreover, both algorithms can support parallel computing environments. We evaluate Mags and Mags-DM on graphs up to billion-scale and demonstrate that they achieve state-of-the-art in both compactness and efficiency, rather than in one of them. Compared with the method that offers state-of-the-art compactness, Mags and Mags-DM have a small difference (< 0.1% and < 2.1%) in compactness. For efficiency, Mags is on average 11.1x and 4.2x faster than the two state-of-the-art algorithms with practical overheads, while Mags-DM can further reduce the running time by 13.4x compared with Mags. This shows that graph summarization algorithms can be made practical while still offering a compact summary.

Read full abstract

Triple Store Research Articles

Related Topics

Articles published on Triple Store

Storage and Query of Drug Knowledge Graphs Using Distributed Graph Databases: A Case Study

Scalable and High-Performance Large-Scale Dynamic Graph Storage and Processing System

High-Performance Graph Storage and Mutation for Graph Processing and Streaming

A machine learning potential construction based on radial distribution function sampling.

AutoGMap: Learning to Map Large-Scale Sparse Graphs on Memristive Crossbars.

A review of graph neural networks and pretrained language models for knowledge graph reasoning

Analyzing workload trends for boosting triple stores performance

Poligras: Policy-Based Graph Summarization

Graph Summarization: Compactness Meets Efficiency

GENTI: GPU-Powered Walk-Based Subgraph Extraction for Scalable Representation Learning on Dynamic Graphs

Declarative generation of RDF-star graphs from heterogeneous data

Spruce: a Fast yet Space-saving Structure for Dynamic Graph Storage

Building a High-Performance Graph Storage on Top of Tree-Structured Key-Value Stores

LUBM4OBDA: Benchmarking OBDA Systems with Inference and Meta Knowledge

A simple and efficient approach to unsupervised instance matching and its application to linked data of power plants

Knowledge graph‐driven data processing for business intelligence

Towards fair and personalized federated recommendation

Multi-hop path reasoning over sparse temporal knowledge graphs based on path completion and reward shaping

NeutronStream: A Dynamic GNN Training Framework with Sliding Window for Graph Streams

The Design of Triple Store and Query Processing on GPU for Large Scale Resource Description Framework Data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Triple Store Research Articles

Related Topics

Articles published on Triple Store

Storage and Query of Drug Knowledge Graphs Using Distributed Graph Databases: A Case Study

Scalable and High-Performance Large-Scale Dynamic Graph Storage and Processing System

High-Performance Graph Storage and Mutation for Graph Processing and Streaming

A machine learning potential construction based on radial distribution function sampling.

AutoGMap: Learning to Map Large-Scale Sparse Graphs on Memristive Crossbars.

A review of graph neural networks and pretrained language models for knowledge graph reasoning

Analyzing workload trends for boosting triple stores performance

Poligras: Policy-Based Graph Summarization

Graph Summarization: Compactness Meets Efficiency

GENTI: GPU-Powered Walk-Based Subgraph Extraction for Scalable Representation Learning on Dynamic Graphs

Declarative generation of RDF-star graphs from heterogeneous data

Spruce: a Fast yet Space-saving Structure for Dynamic Graph Storage

Building a High-Performance Graph Storage on Top of Tree-Structured Key-Value Stores

LUBM4OBDA: Benchmarking OBDA Systems with Inference and Meta Knowledge

A simple and efficient approach to unsupervised instance matching and its application to linked data of power plants

Knowledge graph‐driven data processing for business intelligence

Towards fair and personalized federated recommendation

Multi-hop path reasoning over sparse temporal knowledge graphs based on path completion and reward shaping

NeutronStream: A Dynamic GNN Training Framework with Sliding Window for Graph Streams

The Design of Triple Store and Query Processing on GPU for Large Scale Resource Description Framework Data