Privacy-preserving Linear Regression Scheme and Its Application

Linear regression is an important and widely used machine learning algorithm.The training of linear regression model usually depends on a large amount of data.In reality,the data set is generally held by different users and contains their privacy information.When multiple users want to gather more d...

Full description

Bibliographic Details
Main Author: LYU You, WU Wen-yuan
Format: Article
Language:zho
Published: Editorial office of Computer Science 2022-09-01
Series:Jisuanji kexue
Subjects:
Online Access:https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-9-318.pdf
_version_ 1797845128394571776
author LYU You, WU Wen-yuan
author_facet LYU You, WU Wen-yuan
author_sort LYU You, WU Wen-yuan
collection DOAJ
description Linear regression is an important and widely used machine learning algorithm.The training of linear regression model usually depends on a large amount of data.In reality,the data set is generally held by different users and contains their privacy information.When multiple users want to gather more data to train a better model,it inevitably involves users' privacy.As a privacy protection technology,homomorphic encryption can effectively solve the problem of privacy leakage in computing.A new privacy preserving linear regression scheme based on hybrid iterative method is designed for the scenario where data sets are distri-buted horizontally on two users.The scheme is divided into two stages.The first stage implements the statistic gradient descent algorithm in the ciphertext domain.In the second stage,a secure two-party fast descent protocol is designed.The core idea of the protocol is based on Jacobi iterative method,which can effectively make up for the poor convergence effect of gradient descent method in practical application,accelerate the convergence of the model,and protect the data privacy of two users while effectively training the linear regression model.The efficiency,communication loss and security of the scheme are analyzed.The scheme is implemented by using C++and applied to real data sets.A large number of experimental results show that the scheme can effectively solve the linear regression problem with large scale features.The relative error of decision coefficient is less than 0.001,which show that the application effect of the privacy preserving linear regression model in real data set is close to that obtained directly from unencrypted data,and the scheme can meet the practical application requirements in specific scenarios.
first_indexed 2024-04-09T17:33:34Z
format Article
id doaj.art-60705028bb494696a3b53075a18630e8
institution Directory Open Access Journal
issn 1002-137X
language zho
last_indexed 2024-04-09T17:33:34Z
publishDate 2022-09-01
publisher Editorial office of Computer Science
record_format Article
series Jisuanji kexue
spelling doaj.art-60705028bb494696a3b53075a18630e82023-04-18T02:32:31ZzhoEditorial office of Computer ScienceJisuanji kexue1002-137X2022-09-0149931832510.11896/jsjkx.220300190Privacy-preserving Linear Regression Scheme and Its ApplicationLYU You, WU Wen-yuan01 Chongqing Institute of Green and Intelligent Technology,Chinese Academy of Sciences,Chongqing 400714,China ;2 University of Chinese Academy Sciences,Beijing 100049,ChinaLinear regression is an important and widely used machine learning algorithm.The training of linear regression model usually depends on a large amount of data.In reality,the data set is generally held by different users and contains their privacy information.When multiple users want to gather more data to train a better model,it inevitably involves users' privacy.As a privacy protection technology,homomorphic encryption can effectively solve the problem of privacy leakage in computing.A new privacy preserving linear regression scheme based on hybrid iterative method is designed for the scenario where data sets are distri-buted horizontally on two users.The scheme is divided into two stages.The first stage implements the statistic gradient descent algorithm in the ciphertext domain.In the second stage,a secure two-party fast descent protocol is designed.The core idea of the protocol is based on Jacobi iterative method,which can effectively make up for the poor convergence effect of gradient descent method in practical application,accelerate the convergence of the model,and protect the data privacy of two users while effectively training the linear regression model.The efficiency,communication loss and security of the scheme are analyzed.The scheme is implemented by using C++and applied to real data sets.A large number of experimental results show that the scheme can effectively solve the linear regression problem with large scale features.The relative error of decision coefficient is less than 0.001,which show that the application effect of the privacy preserving linear regression model in real data set is close to that obtained directly from unencrypted data,and the scheme can meet the practical application requirements in specific scenarios.https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-9-318.pdfprivacy-preserving|linear regression|hybrid iterative method|homomorphic encryption
spellingShingle LYU You, WU Wen-yuan
Privacy-preserving Linear Regression Scheme and Its Application
Jisuanji kexue
privacy-preserving|linear regression|hybrid iterative method|homomorphic encryption
title Privacy-preserving Linear Regression Scheme and Its Application
title_full Privacy-preserving Linear Regression Scheme and Its Application
title_fullStr Privacy-preserving Linear Regression Scheme and Its Application
title_full_unstemmed Privacy-preserving Linear Regression Scheme and Its Application
title_short Privacy-preserving Linear Regression Scheme and Its Application
title_sort privacy preserving linear regression scheme and its application
topic privacy-preserving|linear regression|hybrid iterative method|homomorphic encryption
url https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-9-318.pdf
work_keys_str_mv AT lyuyouwuwenyuan privacypreservinglinearregressionschemeanditsapplication