Privacy-preserving Linear Regression Scheme and Its Application
Linear regression is an important and widely used machine learning algorithm.The training of linear regression model usually depends on a large amount of data.In reality,the data set is generally held by different users and contains their privacy information.When multiple users want to gather more d...
Main Author: | |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial office of Computer Science
2022-09-01
|
Series: | Jisuanji kexue |
Subjects: | |
Online Access: | https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-9-318.pdf |
_version_ | 1797845128394571776 |
---|---|
author | LYU You, WU Wen-yuan |
author_facet | LYU You, WU Wen-yuan |
author_sort | LYU You, WU Wen-yuan |
collection | DOAJ |
description | Linear regression is an important and widely used machine learning algorithm.The training of linear regression model usually depends on a large amount of data.In reality,the data set is generally held by different users and contains their privacy information.When multiple users want to gather more data to train a better model,it inevitably involves users' privacy.As a privacy protection technology,homomorphic encryption can effectively solve the problem of privacy leakage in computing.A new privacy preserving linear regression scheme based on hybrid iterative method is designed for the scenario where data sets are distri-buted horizontally on two users.The scheme is divided into two stages.The first stage implements the statistic gradient descent algorithm in the ciphertext domain.In the second stage,a secure two-party fast descent protocol is designed.The core idea of the protocol is based on Jacobi iterative method,which can effectively make up for the poor convergence effect of gradient descent method in practical application,accelerate the convergence of the model,and protect the data privacy of two users while effectively training the linear regression model.The efficiency,communication loss and security of the scheme are analyzed.The scheme is implemented by using C++and applied to real data sets.A large number of experimental results show that the scheme can effectively solve the linear regression problem with large scale features.The relative error of decision coefficient is less than 0.001,which show that the application effect of the privacy preserving linear regression model in real data set is close to that obtained directly from unencrypted data,and the scheme can meet the practical application requirements in specific scenarios. |
first_indexed | 2024-04-09T17:33:34Z |
format | Article |
id | doaj.art-60705028bb494696a3b53075a18630e8 |
institution | Directory Open Access Journal |
issn | 1002-137X |
language | zho |
last_indexed | 2024-04-09T17:33:34Z |
publishDate | 2022-09-01 |
publisher | Editorial office of Computer Science |
record_format | Article |
series | Jisuanji kexue |
spelling | doaj.art-60705028bb494696a3b53075a18630e82023-04-18T02:32:31ZzhoEditorial office of Computer ScienceJisuanji kexue1002-137X2022-09-0149931832510.11896/jsjkx.220300190Privacy-preserving Linear Regression Scheme and Its ApplicationLYU You, WU Wen-yuan01 Chongqing Institute of Green and Intelligent Technology,Chinese Academy of Sciences,Chongqing 400714,China ;2 University of Chinese Academy Sciences,Beijing 100049,ChinaLinear regression is an important and widely used machine learning algorithm.The training of linear regression model usually depends on a large amount of data.In reality,the data set is generally held by different users and contains their privacy information.When multiple users want to gather more data to train a better model,it inevitably involves users' privacy.As a privacy protection technology,homomorphic encryption can effectively solve the problem of privacy leakage in computing.A new privacy preserving linear regression scheme based on hybrid iterative method is designed for the scenario where data sets are distri-buted horizontally on two users.The scheme is divided into two stages.The first stage implements the statistic gradient descent algorithm in the ciphertext domain.In the second stage,a secure two-party fast descent protocol is designed.The core idea of the protocol is based on Jacobi iterative method,which can effectively make up for the poor convergence effect of gradient descent method in practical application,accelerate the convergence of the model,and protect the data privacy of two users while effectively training the linear regression model.The efficiency,communication loss and security of the scheme are analyzed.The scheme is implemented by using C++and applied to real data sets.A large number of experimental results show that the scheme can effectively solve the linear regression problem with large scale features.The relative error of decision coefficient is less than 0.001,which show that the application effect of the privacy preserving linear regression model in real data set is close to that obtained directly from unencrypted data,and the scheme can meet the practical application requirements in specific scenarios.https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-9-318.pdfprivacy-preserving|linear regression|hybrid iterative method|homomorphic encryption |
spellingShingle | LYU You, WU Wen-yuan Privacy-preserving Linear Regression Scheme and Its Application Jisuanji kexue privacy-preserving|linear regression|hybrid iterative method|homomorphic encryption |
title | Privacy-preserving Linear Regression Scheme and Its Application |
title_full | Privacy-preserving Linear Regression Scheme and Its Application |
title_fullStr | Privacy-preserving Linear Regression Scheme and Its Application |
title_full_unstemmed | Privacy-preserving Linear Regression Scheme and Its Application |
title_short | Privacy-preserving Linear Regression Scheme and Its Application |
title_sort | privacy preserving linear regression scheme and its application |
topic | privacy-preserving|linear regression|hybrid iterative method|homomorphic encryption |
url | https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-9-318.pdf |
work_keys_str_mv | AT lyuyouwuwenyuan privacypreservinglinearregressionschemeanditsapplication |