DCache – Efficient Message Encoding For Inter-Service Communication in dCache: Evaluation of Existing Serialization Protocols as a Replacement for Java Object Serialization

Lea Morschel,Paul Millar,Juergen Starek,Marina Sahakyan,Sibel Yasar,Albert Rossi,Dmitry Litvintsev,Vincent Garonne,Olufemi Adeyemi,Tigran Mkrtchyan

doi:10.1051/epjconf/202024505017

Abstract

As a well established, large-scale distributed storage system, dCache is used to manage and serve huge amounts of data collected by high energy physics, astrophysics and photon science experiments. Based on a microservices-like architecture, dCache is built as a modular distributed system, where each component provides a different core functionality. These services communicate by passing serialized messages to each other, a core behavior whose performance properties can consequently affect the entire system. This paper compares and evaluates different data serialization protocols in computer science with the objective of replacing and improving upon Java Object Serialization (JOS), which has increasingly presented itself as no longer being sufficiently performant for encoding messages. The criteria for choosing a new framework are collected, analyzed and formalized. The primary motivation for replacing Java serialization for encoding dCache messages is increasing the general speed of message-passing and thereby reducing the round-trip time for user requests. Emphasis is also placed on schema evolution capabilities and framework usability. Approaches for generalizing (de)serialization speed and size measurements based on data structure complexity are introduced, criteria for measuring documentation, learning curve, maintainability and introduction effort are defined. Finally, several selected serialization protocols are evaluated and compared accordingly, concluding with a recommendation for a suitable JOS replacement.

Highlights

The dCache software [1] is an open-source distributed storage system written in Java, which uses a microservices-like architecture to provide location-independent access to data
The primary motivation for replacing Java serialization is increasing the general speed of message-passing and thereby reducing the round-trip time for user requests
Within dCache, the Java object serialization is used to serialize these messages to a binary format

Summary

Introduction

The dCache software [1] is an open-source distributed storage system written in Java, which uses a microservices-like architecture to provide location-independent access to data. It is designed to support a wide range of use cases, from high-throughput data ingest, being dynamically scalable to hundreds of petabytes, as well as deployable in heterogeneous systems and on commodity hardware. It is easy to integrate with other systems, because it can communicate over several protocols for accessing data and enabling authentication, and supports. A significant portion of the time needed for internal communication between services is spent on serializing and deserializing messages. The primary motivation for replacing Java serialization is increasing the general speed of message-passing and thereby reducing the round-trip time for user requests. This paper is the summary of a larger scientific thesis [3]

Related Work

Current Message Serialization in dCache

Criteria for a New Serialization Protocol in dCache

Serialization Protocols to be Evaluated

Evaluation Scenarios

Environment and Tools

Results

Summary and Outlook

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EPJ Web of Conferences	Publication Date: Jan 1, 2020
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

DCache – Efficient Message Encoding For Inter-Service Communication in dCache: Evaluation of Existing Serialization Protocols as a Replacement for Java Object Serialization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EPJ Web of Conferences

Lead the way for us

Similar Papers

The Bulk service and WLCG TAPE API support in dCache
Albert Rossi ... Svenja Meyer
-
Albert Rossi, et. al.Albert Rossi ... Svenja Meyer
27 Apr 2023
27 Apr 2023

High-energy astrophysics and cosmology
John Ellis
Nuclear Physics B - Proceedings Supplements | VOL. 122
John EllisJohn Ellis
01 Jul 2003
Nuclear Physics B - Proceedings Supplements | VOL. 122

COMPUTATIONAL SCIENCE CENTER
J Davenport
-
J DavenportJ Davenport
01 Nov 2005
01 Nov 2005

SciDAC 2008
Michael Strayer
Journal of Physics: Conference Series | VOL. 125
Michael StrayerMichael Strayer
01 Jul 2008
SciDAC 2008
Michael Strayer

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DCache – Efficient Message Encoding For Inter-Service Communication in dCache: Evaluation of Existing Serialization Protocols as a Replacement for Java Object Serialization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EPJ Web of Conferences