Open Data Release and Privacy Concerns: Complexity in Mitigating Vulnerability with Controlled Perturbation

Shah Imran Alam,Syed Imtiyaz Hassan,M Afshar Alam,Farheen Siddiqui,Anil Kumar Mahto,Ihtiram Raza Khan,Rijwan Khan

doi:10.1155/2021/9929049

Shah Imran Alam, Syed Imtiyaz Hassan + Show 5 more

Open Access

https://doi.org/10.1155/2021/9929049

Copy DOI

Abstract

The benefits of open data were realised worldwide since the past decades, and the efforts to move more data under the license of open data intensified. There was a steep rise of open data in government repositories. In our study, we point out that privacy is one of the consistent and leading barriers among others. Strong privacy laws restrict data owners from opening the data freely. In this paper, we attempted to study the applied solutions and to the best of our knowledge, we found that anonymity-preserving algorithms did a substantial job to protect privacy in the release of the structured microdata. Such anonymity-preserving algorithms argue and compete in objectivethat not only could the released anonymized data preserve privacy but also the anonymized data preserve the required level of quality. K-anonymity algorithm was the foundation of many of its successor algorithms of all privacy-preserving algorithms. l-diversity claims to add another dimension of privacy protection. Both these algorithms used together are known to provide a good balance between privacy and quality control of the dataset as a whole entity. In this research, we have used the K-anonymity algorithm and compared the results with the addon of l-diversity. We discussed the gap and reported the benefits and loss with various combinations of K and l values, taken in combination with released data quality from an analyst’s perspective. We first used dummy fictitious data to explain the general expectations and then concluded the contrast in the findings with the real data from the food technology domain. The work contradicts the general assumptions with a specific set of evaluation parameters for data quality assessment. Additionally, it is intended to argue in favour of pushing for research contributions in the field of anonymity preservation and intensify the effort for major trends of research, considering its importance and potential to benefit people.

Highlights

Open data have proved its importance in the field of research, open governance, development versus analysis, and business initiatives. e release of public open data has emerged as a critical need for the overall development of humanity as one nation
Numerous anonymity-based algorithms have been proposed till date to preserve the anonymity concerns of the data
As it is observed from the stacked plot of all anonymization strategies, put together that the k-value is the more dominant factor in reducing the anonymized data quality compared to l-value. at is, in other words, generalization deteriorates the data quality more compared to diversity

Summary

Introduction

Open data have proved its importance in the field of research, open governance, development versus analysis, and business initiatives. e release of public open data has emerged as a critical need for the overall development of humanity as one nation. Researchers worldwide used open COVID-19 data to help governments and organizations like WHO enforce measures and suggest policies. The threat to individuals to whom the data refers is shown up intensely because of the fear of identity recognition or reidentification of it. This has always been a rising concern and is being criticized since long back throughout the world, not just in the COVID-19 data but in all such released data where the identity disclosure attack is possible. Researchers have been trying to find a balance between the quality of open data released and the possibility of identity revelation from attacks

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Open Data Release and Privacy Concerns: Complexity in Mitigating Vulnerability with Controlled Perturbation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Food Quality

Lead the way for us

Journal: Journal of Food Quality	Publication Date: Jun 21, 2021
License type: CC BY 4.0

Similar Papers

A K-Anonymous Full Domain Generalization Algorithm Based on Heap Sort
Xuyang Zhou ... Meikang Qiu
-
Xuyang Zhou, et. al.Xuyang Zhou ... Meikang Qiu
01 Jan 2018
01 Jan 2018

Changing civil servants’ behaviour concerning the opening of governmental data: evaluating the effect of a game by comparing civil servants’ intentions before and after a game intervention
Fernando Kleiman ... Sebastiaan Meijer
International Review of Administrative Sciences | VOL. 88
Fernando Kleiman, et. al.Fernando Kleiman ... Sebastiaan Meijer
30 Sep 2020
International Review of Administrative Sciences | VOL. 88

Open research data - Expectations and limitations.
Karin M Kirschner
Acta Physiologica Scandinavica | VOL. 236
Karin M KirschnerKarin M Kirschner
31 Oct 2022
Acta Physiologica Scandinavica | VOL. 236

Public Servants’ Perception Towards Publishing Quality and Impactful Open Data to Support Open Science Initiatives in Malaysia
Aini Suzana Ariffin ... Mohd Azreey Shah Bin Abd Aziz
Journal of Science, Technology and Innovation Policy | VOL. 8
Aini Suzana Ariffin, et. al.Aini Suzana Ariffin ... Mohd Azreey Shah Bin Abd Aziz
06 Oct 2022
Journal of Science, Technology and Innovation Policy | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Open Data Release and Privacy Concerns: Complexity in Mitigating Vulnerability with Controlled Perturbation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Food Quality