Multi-Party Privacy-Preserving Logistic Regression with Poor Quality Data Filtering for IoT Contributors

Kennedy Edemacu,Jong Wook Kim

doi:10.3390/electronics10172049

Abstract

Nowadays, the internet of things (IoT) is used to generate data in several application domains. A logistic regression, which is a standard machine learning algorithm with a wide application range, is built on such data. Nevertheless, building a powerful and effective logistic regression model requires large amounts of data. Thus, collaboration between multiple IoT participants has often been the go-to approach. However, privacy concerns and poor data quality are two challenges that threaten the success of such a setting. Several studies have proposed different methods to address the privacy concern but to the best of our knowledge, little attention has been paid towards addressing the poor data quality problems in the multi-party logistic regression model. Thus, in this study, we propose a multi-party privacy-preserving logistic regression framework with poor quality data filtering for IoT data contributors to address both problems. Specifically, we propose a new metric gradient similarity in a distributed setting that we employ to filter out parameters from data contributors with poor quality data. To solve the privacy challenge, we employ homomorphic encryption. Theoretical analysis and experimental evaluations using real-world datasets demonstrate that our proposed framework is privacy-preserving and robust against poor quality data.

Highlights

The combined usage of machine learning techniques with the internet of things (IoT) is expected to improve service delivery in several application domains such as industries, smart mobility, cyber-physical systems, smart cities, smart health, etc. [1]
We aim to provide a solution to the problem of data quality and privacy during multi-party logistic regression model training
We propose a novel metric gradient similarity (Gsim) in a distributed setting used to determine the quality of the data contributed by the IoT participants; We combine Gsim with homomorphic encryption (HE) to design a multi-party privacy-preserving logistic regression model that filters out poor quality data during the model training; We perform analysis and conduct experiments with real-world datasets to demonstrate the effectiveness of our designed framework

Summary

Introduction

The combined usage of machine learning techniques (e.g., logistic regression) with the internet of things (IoT) is expected to improve service delivery in several application domains such as industries, smart mobility, cyber-physical systems, smart cities, smart health, etc. [1]. The combined usage of machine learning techniques (e.g., logistic regression) with the internet of things (IoT) is expected to improve service delivery in several application domains such as industries, smart mobility, cyber-physical systems, smart cities, smart health, etc. The success of these machine learning techniques, and in particular logistic regression, depends on the availability of massive training data. IoT parties contribute their data for the model training. In conventional multi-party logistic regressions, a server is required to store, process, and share data from geographically distributed IoT data contributors. Privacy preservation is a major challenge in a multi-party setting

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Aug 25, 2021
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multi-Party Privacy-Preserving Logistic Regression with Poor Quality Data Filtering for IoT Contributors

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

The costs of poor data quality
Anders Haug ... Frederik Zachariassen
Journal of Industrial Engineering and Management | VOL. 4
Anders Haug, et. al.Anders Haug ... Frederik Zachariassen
14 Jul 2011
Journal of Industrial Engineering and Management | VOL. 4

The costs of poor data quality
...
Journal of Industrial Engineering and Management | VOL. 4
, et. al. ...
21 Jul 2011
Journal of Industrial Engineering and Management | VOL. 4

Fitting straight lines to poor quality ( χ, y) data
R.K Pearson
Mathematical and Computer Modelling | VOL. 16
R.K PearsonR.K Pearson
01 Mar 1992
Mathematical and Computer Modelling | VOL. 16

Application of 3D Sampling Trajectory in EVDRS Algorithm
Zhongyuan Mou ... Jie Yang
-
Zhongyuan Mou, et. al.Zhongyuan Mou ... Jie Yang
01 Dec 2013
01 Dec 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Party Privacy-Preserving Logistic Regression with Poor Quality Data Filtering for IoT Contributors

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics