A simulated ‘sandbox’ for exploring the modifiable areal unit problem in aggregation and disaggregation

Abstract We present a spatial testbed of simulated boundary data based on a set of very high-resolution census-based areal units surrounding Guadalajara, Mexico. From these input areal units, we simulated 10 levels of spatial resolutions, ranging from levels with 5,515–52,388 units and 100 simulated...

Full description

Bibliographic Details
Main Authors: Jeremiah J. Nieves, Andrea E. Gaughan, Forrest R. Stevens, Greg Yetman, Andreas Gros
Format: Article
Language:English
Published: Nature Portfolio 2024-02-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-024-03061-1
_version_ 1797275914150608896
author Jeremiah J. Nieves
Andrea E. Gaughan
Forrest R. Stevens
Greg Yetman
Andreas Gros
author_facet Jeremiah J. Nieves
Andrea E. Gaughan
Forrest R. Stevens
Greg Yetman
Andreas Gros
author_sort Jeremiah J. Nieves
collection DOAJ
description Abstract We present a spatial testbed of simulated boundary data based on a set of very high-resolution census-based areal units surrounding Guadalajara, Mexico. From these input areal units, we simulated 10 levels of spatial resolutions, ranging from levels with 5,515–52,388 units and 100 simulated zonal configurations for each level – totalling 1,000 simulated sets of areal units. These data facilitate interrogating various realizations of the data and the effects of the spatial coarseness and zonal configurations, the Modifiable Areal Unit Problem (MAUP), on applications such as model training, model prediction, disaggregation, and aggregation processes. Further, these data can facilitate the production of spatially explicit, non-parametric estimates of confidence intervals via bootstrapping. We provide a pre-processed version of these 1,000 simulated sets of areal units, meta- and summary data to assist in their use, and a code notebook with the means to alter and/or reproduce these data.
first_indexed 2024-03-07T15:20:50Z
format Article
id doaj.art-47adb82e25464bdab2ec1d4791a0cb44
institution Directory Open Access Journal
issn 2052-4463
language English
last_indexed 2024-03-07T15:20:50Z
publishDate 2024-02-01
publisher Nature Portfolio
record_format Article
series Scientific Data
spelling doaj.art-47adb82e25464bdab2ec1d4791a0cb442024-03-05T17:39:31ZengNature PortfolioScientific Data2052-44632024-02-0111111010.1038/s41597-024-03061-1A simulated ‘sandbox’ for exploring the modifiable areal unit problem in aggregation and disaggregationJeremiah J. Nieves0Andrea E. Gaughan1Forrest R. Stevens2Greg Yetman3Andreas Gros4University of Glasgow, School of Geographical & Earth SciencesUniversity of Louisville, Dept. of Geographic and Environmental SciencesUniversity of Louisville, Dept. of Geographic and Environmental SciencesCenter for International Earth Science Information Network (CIESIN), University of ColumbiaVibrant Planet PBCAbstract We present a spatial testbed of simulated boundary data based on a set of very high-resolution census-based areal units surrounding Guadalajara, Mexico. From these input areal units, we simulated 10 levels of spatial resolutions, ranging from levels with 5,515–52,388 units and 100 simulated zonal configurations for each level – totalling 1,000 simulated sets of areal units. These data facilitate interrogating various realizations of the data and the effects of the spatial coarseness and zonal configurations, the Modifiable Areal Unit Problem (MAUP), on applications such as model training, model prediction, disaggregation, and aggregation processes. Further, these data can facilitate the production of spatially explicit, non-parametric estimates of confidence intervals via bootstrapping. We provide a pre-processed version of these 1,000 simulated sets of areal units, meta- and summary data to assist in their use, and a code notebook with the means to alter and/or reproduce these data.https://doi.org/10.1038/s41597-024-03061-1
spellingShingle Jeremiah J. Nieves
Andrea E. Gaughan
Forrest R. Stevens
Greg Yetman
Andreas Gros
A simulated ‘sandbox’ for exploring the modifiable areal unit problem in aggregation and disaggregation
Scientific Data
title A simulated ‘sandbox’ for exploring the modifiable areal unit problem in aggregation and disaggregation
title_full A simulated ‘sandbox’ for exploring the modifiable areal unit problem in aggregation and disaggregation
title_fullStr A simulated ‘sandbox’ for exploring the modifiable areal unit problem in aggregation and disaggregation
title_full_unstemmed A simulated ‘sandbox’ for exploring the modifiable areal unit problem in aggregation and disaggregation
title_short A simulated ‘sandbox’ for exploring the modifiable areal unit problem in aggregation and disaggregation
title_sort simulated sandbox for exploring the modifiable areal unit problem in aggregation and disaggregation
url https://doi.org/10.1038/s41597-024-03061-1
work_keys_str_mv AT jeremiahjnieves asimulatedsandboxforexploringthemodifiablearealunitprobleminaggregationanddisaggregation
AT andreaegaughan asimulatedsandboxforexploringthemodifiablearealunitprobleminaggregationanddisaggregation
AT forrestrstevens asimulatedsandboxforexploringthemodifiablearealunitprobleminaggregationanddisaggregation
AT gregyetman asimulatedsandboxforexploringthemodifiablearealunitprobleminaggregationanddisaggregation
AT andreasgros asimulatedsandboxforexploringthemodifiablearealunitprobleminaggregationanddisaggregation
AT jeremiahjnieves simulatedsandboxforexploringthemodifiablearealunitprobleminaggregationanddisaggregation
AT andreaegaughan simulatedsandboxforexploringthemodifiablearealunitprobleminaggregationanddisaggregation
AT forrestrstevens simulatedsandboxforexploringthemodifiablearealunitprobleminaggregationanddisaggregation
AT gregyetman simulatedsandboxforexploringthemodifiablearealunitprobleminaggregationanddisaggregation
AT andreasgros simulatedsandboxforexploringthemodifiablearealunitprobleminaggregationanddisaggregation