Abstract

The digital transformation of companies leads to the transfer of many technological and business processes to the cloud, which leads to new risks of confidential information presented in different data formats in the cloud infrastructure, as well as the complexity and heterogeneity of the cloud infrastructure. Cloud infocommunication systems built upon open-source software components expose variabilities in data formats and structures. Such data formats and structures may be convenient and reasonable when developing cloud components, although working with it during operation and maintenance is accompanied with the problem of a mismatch between the varieties and with the lack of special tools to query among the multiple data formats. In the article such data formats as YAML, JSON and XML are considered and thus an opportunity of its transformation into a single format is taken as a hypothesis. Considering analysis results of original data samples taken from OpenStack components three typical data structures are proposed. The other hypothesis is about the opportunity to join data taken from multiple sources. In order to verify assumptions, the algorithms for data transformation into a tabular format are developed. The quickness of transformations is verified on a large sample and obtained results are acceptable for usage while operating and maintaining cloud infocommunication systems. After applying the proposed algorithms to the original data samples an SQL query and a way to query on multiple data sources are proposed. The results of research may have an application for the cloud infrastructure availability solutions when working with multiple data formats.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call