Abstract

A data lake is a relatively recent technology to maintain and allow access to voluminous and heterogeneous data sources. Governments, large corporations and startups have increasingly considered it for storing useful data and obtain valuable business trends. However, there is still a long evolutionary path related to data lake management, where data security is an open issue. In this paper we investigate confidentiality issues in the context of data lakes, with a focus on authentication and authorization. We apply a systematic review methodology focusing on approaches that provide some technology for authentication and authorization management. In the following, we compare the selected studies w.r.t. the used technologies and we also analyze how they are positioned w.r.t. a reference architecture for a data lake management system. This is the first paper that presents such a kind of analysis for data lakes.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call