Abstract
Graph storage technology is confronted with an enormous challenge as far as the compact and complex graph-structure data. This phenomenon is derived from social networks with spatially intensive data. Since a hot event can cause the generation of a network cluster, which consists of a massive duplicate associated entities in the social networks, the space utilization and processing speed of graph data is obstructed. Therefore, it is necessary to design a graph storage mechanism specifically for the above data. In this paper, we propose a G raph compression S torage engine based on spatial C luster entity O ptimization ( GSCO ), which improves the native graph storage model through the proposed the many-to-one mapping structure and a H eat E volution E limination algorithm ( H2E ). Firstly, we define the spatial cluster entity formally and confirm the compressed storage objects. Then, we introduce the many-to-one relationship to transfer the mapping structure between the node and property. It compresses the data to raise the space utilization of the graph database. Finally, we propose the H2E algorithm that allows the representative nodes to be anchored an extended period in memory according to the heat evolution acceleration. It increases the hit rate and throughput and reduces the I/O operation by deleting the redundancy of data. Extensive experiments results show that the proposed GSCO storage model is better than Neo4j for reading and writing data in spatial clustering entity. It significantly promotes the effectiveness of graph operation, including the data loading, the common queries, and the clustering test.
Highlights
Graph database has been extensively studied and applied with the emergence of abundant relational data such as knowledge mapping and social networks
We evaluate the proposed GSCO model on a real-public dataset, which is collected from Sina Weibo and consists of 13 hot events
In this paper, we propose the new graph storage engine GSCO, considering that the existing storage model did not scale well to spatial cluster entity in the social networks
Summary
Graph database has been extensively studied and applied with the emergence of abundant relational data such as knowledge mapping and social networks. The goal of this paper is to design a new storage model in the graph database and compress the duplicate data presenting space-intensive in the social network. The abundance network structure of the space-intensive clusters increases the workload of the graph database, which can occupy the main system resources when traversing its nodes. The extensive experimental results show that our GSCO model can achieve high space utilization in compressed storage and effectively improves the cache hit rate in the data read and query. These outperform, GSCO is significantly superior to the state-of-the-art Neo4j graph model adopting the labeled property graphs.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.