New Efficient Approach to Solve Big Data Systems Using Parallel Gauss–Seidel Algorithms
In order to perform big-data analytics, regression involving large matrices is often necessary. In particular, large scale regression problems are encountered when one wishes to extract semantic patterns for knowledge discovery and data mining. When a large matrix can be processed in its factorized...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-04-01
|
Series: | Big Data and Cognitive Computing |
Subjects: | |
Online Access: | https://www.mdpi.com/2504-2289/6/2/43 |
_version_ | 1797490035139805184 |
---|---|
author | Shih Yu Chang Hsiao-Chun Wu Yifan Wang |
author_facet | Shih Yu Chang Hsiao-Chun Wu Yifan Wang |
author_sort | Shih Yu Chang |
collection | DOAJ |
description | In order to perform big-data analytics, regression involving large matrices is often necessary. In particular, large scale regression problems are encountered when one wishes to extract semantic patterns for knowledge discovery and data mining. When a large matrix can be processed in its factorized form, advantages arise in terms of computation, implementation, and data-compression. In this work, we propose two new parallel iterative algorithms as extensions of the Gauss–Seidel algorithm (GSA) to solve regression problems involving many variables. The convergence study in terms of error-bounds of the proposed iterative algorithms is also performed, and the required computation resources, namely time- and memory-complexities, are evaluated to benchmark the efficiency of the proposed new algorithms. Finally, the numerical results from both Monte Carlo simulations and real-world datasets are presented to demonstrate the striking effectiveness of our proposed new methods. |
first_indexed | 2024-03-10T00:25:13Z |
format | Article |
id | doaj.art-ed4b4dabcf884e2895a68aa37320eb76 |
institution | Directory Open Access Journal |
issn | 2504-2289 |
language | English |
last_indexed | 2024-03-10T00:25:13Z |
publishDate | 2022-04-01 |
publisher | MDPI AG |
record_format | Article |
series | Big Data and Cognitive Computing |
spelling | doaj.art-ed4b4dabcf884e2895a68aa37320eb762023-11-23T15:36:10ZengMDPI AGBig Data and Cognitive Computing2504-22892022-04-01624310.3390/bdcc6020043New Efficient Approach to Solve Big Data Systems Using Parallel Gauss–Seidel AlgorithmsShih Yu Chang0Hsiao-Chun Wu1Yifan Wang2The Department of Applied Data Science, San Jose State University, San Jose, CA 95192, USAThe School of Electrical Engineering and Computer Science, Louisiana State University, Baton Rouge, LA 70803, USAThe Department of Computer Science, California University, Santa Clara, CA 95054, USAIn order to perform big-data analytics, regression involving large matrices is often necessary. In particular, large scale regression problems are encountered when one wishes to extract semantic patterns for knowledge discovery and data mining. When a large matrix can be processed in its factorized form, advantages arise in terms of computation, implementation, and data-compression. In this work, we propose two new parallel iterative algorithms as extensions of the Gauss–Seidel algorithm (GSA) to solve regression problems involving many variables. The convergence study in terms of error-bounds of the proposed iterative algorithms is also performed, and the required computation resources, namely time- and memory-complexities, are evaluated to benchmark the efficiency of the proposed new algorithms. Finally, the numerical results from both Monte Carlo simulations and real-world datasets are presented to demonstrate the striking effectiveness of our proposed new methods.https://www.mdpi.com/2504-2289/6/2/43Gauss–Seidel algorithmrandom iterationsmatrix factorizationlinear systemsbig data |
spellingShingle | Shih Yu Chang Hsiao-Chun Wu Yifan Wang New Efficient Approach to Solve Big Data Systems Using Parallel Gauss–Seidel Algorithms Big Data and Cognitive Computing Gauss–Seidel algorithm random iterations matrix factorization linear systems big data |
title | New Efficient Approach to Solve Big Data Systems Using Parallel Gauss–Seidel Algorithms |
title_full | New Efficient Approach to Solve Big Data Systems Using Parallel Gauss–Seidel Algorithms |
title_fullStr | New Efficient Approach to Solve Big Data Systems Using Parallel Gauss–Seidel Algorithms |
title_full_unstemmed | New Efficient Approach to Solve Big Data Systems Using Parallel Gauss–Seidel Algorithms |
title_short | New Efficient Approach to Solve Big Data Systems Using Parallel Gauss–Seidel Algorithms |
title_sort | new efficient approach to solve big data systems using parallel gauss seidel algorithms |
topic | Gauss–Seidel algorithm random iterations matrix factorization linear systems big data |
url | https://www.mdpi.com/2504-2289/6/2/43 |
work_keys_str_mv | AT shihyuchang newefficientapproachtosolvebigdatasystemsusingparallelgaussseidelalgorithms AT hsiaochunwu newefficientapproachtosolvebigdatasystemsusingparallelgaussseidelalgorithms AT yifanwang newefficientapproachtosolvebigdatasystemsusingparallelgaussseidelalgorithms |