High-performance solutions of geographically weighted regression in R

ABSTRACTAs an established spatial analytical tool, Geographically Weighted Regression (GWR) has been applied across a variety of disciplines. However, its usage can be challenging for large datasets, which are increasingly prevalent in today’s digital world. In this study, we propose two high-perfor...

Full description

Bibliographic Details
Main Authors: Binbin Lu, Yigong Hu, Daisuke Murakami, Chris Brunsdon, Alexis Comber, Martin Charlton, Paul Harris
Format: Article
Language:English
Published: Taylor & Francis Group 2022-10-01
Series:Geo-spatial Information Science
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/10095020.2022.2064244
_version_ 1811205507590914048
author Binbin Lu
Yigong Hu
Daisuke Murakami
Chris Brunsdon
Alexis Comber
Martin Charlton
Paul Harris
author_facet Binbin Lu
Yigong Hu
Daisuke Murakami
Chris Brunsdon
Alexis Comber
Martin Charlton
Paul Harris
author_sort Binbin Lu
collection DOAJ
description ABSTRACTAs an established spatial analytical tool, Geographically Weighted Regression (GWR) has been applied across a variety of disciplines. However, its usage can be challenging for large datasets, which are increasingly prevalent in today’s digital world. In this study, we propose two high-performance R solutions for GWR via Multi-core Parallel (MP) and Compute Unified Device Architecture (CUDA) techniques, respectively GWR-MP and GWR-CUDA. We compared GWR-MP and GWR-CUDA with three existing solutions available in Geographically Weighted Models (GWmodel), Multi-scale GWR (MGWR) and Fast GWR (FastGWR). Results showed that all five solutions perform differently across varying sample sizes, with no single solution a clear winner in terms of computational efficiency. Specifically, solutions given in GWmodel and MGWR provided acceptable computational costs for GWR studies with a relatively small sample size. For a large sample size, GWR-MP and FastGWR provided coherent solutions on a Personal Computer (PC) with a common multi-core configuration, GWR-MP provided more efficient computing capacity for each core or thread than FastGWR. For cases when the sample size was very large, and for these cases only, GWR-CUDA provided the most efficient solution, but should note its I/O cost with small samples. In summary, GWR-MP and GWR-CUDA provided complementary high-performance R solutions to existing ones, where for certain data-rich GWR studies, they should be preferred.
first_indexed 2024-04-12T03:32:48Z
format Article
id doaj.art-ef6fab6a9cd448b7a8c0440be9dce91a
institution Directory Open Access Journal
issn 1009-5020
1993-5153
language English
last_indexed 2024-04-12T03:32:48Z
publishDate 2022-10-01
publisher Taylor & Francis Group
record_format Article
series Geo-spatial Information Science
spelling doaj.art-ef6fab6a9cd448b7a8c0440be9dce91a2022-12-22T03:49:30ZengTaylor & Francis GroupGeo-spatial Information Science1009-50201993-51532022-10-0125453654910.1080/10095020.2022.2064244High-performance solutions of geographically weighted regression in RBinbin Lu0Yigong Hu1Daisuke Murakami2Chris Brunsdon3Alexis Comber4Martin Charlton5Paul Harris6School of Remote Sensing and Information Engineering, Wuhan University, Wuhan, ChinaSchool of Remote Sensing and Information Engineering, Wuhan University, Wuhan, ChinaDepartment of Data Science, Institute of Mathematical Statistics, Tokyo, JapanNational Centre for Geocomputation, Maynooth University, Maynooth, IrelandSchool of Geography, University of Leeds, Leeds, UKNational Centre for Geocomputation, Maynooth University, Maynooth, IrelandSustainable Agriculture Sciences North Wyke, Rothamsted Research, Okehampton, UKABSTRACTAs an established spatial analytical tool, Geographically Weighted Regression (GWR) has been applied across a variety of disciplines. However, its usage can be challenging for large datasets, which are increasingly prevalent in today’s digital world. In this study, we propose two high-performance R solutions for GWR via Multi-core Parallel (MP) and Compute Unified Device Architecture (CUDA) techniques, respectively GWR-MP and GWR-CUDA. We compared GWR-MP and GWR-CUDA with three existing solutions available in Geographically Weighted Models (GWmodel), Multi-scale GWR (MGWR) and Fast GWR (FastGWR). Results showed that all five solutions perform differently across varying sample sizes, with no single solution a clear winner in terms of computational efficiency. Specifically, solutions given in GWmodel and MGWR provided acceptable computational costs for GWR studies with a relatively small sample size. For a large sample size, GWR-MP and FastGWR provided coherent solutions on a Personal Computer (PC) with a common multi-core configuration, GWR-MP provided more efficient computing capacity for each core or thread than FastGWR. For cases when the sample size was very large, and for these cases only, GWR-CUDA provided the most efficient solution, but should note its I/O cost with small samples. In summary, GWR-MP and GWR-CUDA provided complementary high-performance R solutions to existing ones, where for certain data-rich GWR studies, they should be preferred.https://www.tandfonline.com/doi/10.1080/10095020.2022.2064244Non-stationaritybig dataparallel computingCompute Unified Device Architecture (CUDA)Geographically Weighted models (GWmodel)
spellingShingle Binbin Lu
Yigong Hu
Daisuke Murakami
Chris Brunsdon
Alexis Comber
Martin Charlton
Paul Harris
High-performance solutions of geographically weighted regression in R
Geo-spatial Information Science
Non-stationarity
big data
parallel computing
Compute Unified Device Architecture (CUDA)
Geographically Weighted models (GWmodel)
title High-performance solutions of geographically weighted regression in R
title_full High-performance solutions of geographically weighted regression in R
title_fullStr High-performance solutions of geographically weighted regression in R
title_full_unstemmed High-performance solutions of geographically weighted regression in R
title_short High-performance solutions of geographically weighted regression in R
title_sort high performance solutions of geographically weighted regression in r
topic Non-stationarity
big data
parallel computing
Compute Unified Device Architecture (CUDA)
Geographically Weighted models (GWmodel)
url https://www.tandfonline.com/doi/10.1080/10095020.2022.2064244
work_keys_str_mv AT binbinlu highperformancesolutionsofgeographicallyweightedregressioninr
AT yigonghu highperformancesolutionsofgeographicallyweightedregressioninr
AT daisukemurakami highperformancesolutionsofgeographicallyweightedregressioninr
AT chrisbrunsdon highperformancesolutionsofgeographicallyweightedregressioninr
AT alexiscomber highperformancesolutionsofgeographicallyweightedregressioninr
AT martincharlton highperformancesolutionsofgeographicallyweightedregressioninr
AT paulharris highperformancesolutionsofgeographicallyweightedregressioninr