Abstract

In scientific domains such as high-energy particle physics and genomics, the quantity of high-speed data traffic generated may far exceed the storage throughput and be unable to be in time stored in the current node. Cooperating and utilizing multiple storage nodes on the forwarding path provides an opportunity for high-speed data storage. This paper proposes the use of flow entries to dynamically split traffic among selected neighbor nodes to sequentially amortize excess traffic. We propose a neighbor selection mechanism based on the Local Name Mapping and Resolution System, in which the node weights are computed by combing the link bandwidth and node storage capability, and determining whether to split traffic by comparing normalized weight values with thresholds. To dynamically offload traffic among multiple targets, the cooperative storage strategy implemented in a programmable data plane is presented using the relative weights and ID suffix matching. Evaluation shows that our proposed schema is more efficient compared with end-to-end transmission and ECMP in terms of bandwidth usage and transfer time, and is beneficial in big science.

Highlights

  • Carlos Lopez-Ardao, In many scientific disciplines, a huge amount of data is being generated

  • Name Mapping and Resolution System, in which the node weights are computed by combining the link bandwidth and node storage capability, and determining whether to split traffic by comparing normalized weight values with thresholds

  • We mainly focus on the throughput and transfer time influenced by the load balancing schema in the ID-based data chunk transmission

Read more

Summary

Introduction

Carlos Lopez-Ardao, In many scientific disciplines, a huge amount of data is being generated. The work of [6] presented a preliminary study addressing opportunities and challenges to enable NDN-based intelligent data retrieval in networks for high-energy physics (HEP). These studies focused on the management of a huge quantity of scientific data and data retrieval, and rarely involved the persistent storage of collected data in networks. To address high-speed data storage, we proposed an in-network storage service node structure to support cooperative storage in neighbor nodes, and designed an identifier (ID)-based cooperative storage protocol.

Background
Distributed Storage
Storage at the Edge
Current Scientific Data Management Systems
NDN Based Big Science
System Architecture
An ID-Based Cooperative Storage Protocol
In-Network Storage Service Node Structure
Neighbor Selection Mechanism
Cooperative Storage Strategy
Network Environment Setup
Evaluation of the Selection Mechanism
Comparison with End-to-End Transmission
Experiments
Influence of Cooperative Storage Schema
The Accuracy of Traffic Split Ratio
Comparison with Common File Systems
Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.