Village-level poverty identification using machine learning, high-resolution images, and geospatial data

Tracking progress in poverty alleviation and promptly identifying the distribution of poor areas are critical for strategic policy interventions, especially for regions with poor statistical systems. The massive satellite imagery and geospatial data provide great opportunities for timely and cost-ef...

Full description

Bibliographic Details
Main Authors: Shan Hu, Yong Ge, Mengxiao Liu, Zhoupeng Ren, Xining Zhang
Format: Article
Language:English
Published: Elsevier 2022-03-01
Series:International Journal of Applied Earth Observations and Geoinformation
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S0303243422000204
_version_ 1797961223710441472
author Shan Hu
Yong Ge
Mengxiao Liu
Zhoupeng Ren
Xining Zhang
author_facet Shan Hu
Yong Ge
Mengxiao Liu
Zhoupeng Ren
Xining Zhang
author_sort Shan Hu
collection DOAJ
description Tracking progress in poverty alleviation and promptly identifying the distribution of poor areas are critical for strategic policy interventions, especially for regions with poor statistical systems. The massive satellite imagery and geospatial data provide great opportunities for timely and cost-effective socioeconomic evaluations. However, existing research on poverty identification is mostly based on satellite images, and the potential of combined multi-source geospatial data on poverty identification has not been fully explored. Here, we propose an approach that evaluates how village-level poverty can be identified by integrating high-resolution imagery (HRI), point-of-interest (POI), OpenStreetMap (OSM), and digital surface model (DSM) data. The study area included 338 villages from Yunyang County, located in Hubei Province, central China. We extracted the explanatory variables indicating access to facilities and services, agricultural production conditions, village construction, and the spatial distribution of village settlements from the HRI, POI, OSM, and DSM data. The random forest algorithm was then used to model the relationship between village-level poverty and explanatory variables. The results demonstrated a 54% accuracy in the prediction of village-level poverty; the best prediction performance (72%) was observed for the villages categorized as poor. The built-up land proportion and the time cost to the facilities and services contributed the most to the identification of village-level poverty, while the proxy variables of agricultural production conditions contributed the least. This study provides an approach to village-level poverty identification using satellite imagery and geospatial data and proves that the data employed in this study could identify the poorest areas that are highly coupled with natural geographical conditions and backward public services.
first_indexed 2024-04-11T00:56:46Z
format Article
id doaj.art-a84f2f0fa80344b288c1f27e23265694
institution Directory Open Access Journal
issn 1569-8432
language English
last_indexed 2024-04-11T00:56:46Z
publishDate 2022-03-01
publisher Elsevier
record_format Article
series International Journal of Applied Earth Observations and Geoinformation
spelling doaj.art-a84f2f0fa80344b288c1f27e232656942023-01-05T04:31:05ZengElsevierInternational Journal of Applied Earth Observations and Geoinformation1569-84322022-03-01107102694Village-level poverty identification using machine learning, high-resolution images, and geospatial dataShan Hu0Yong Ge1Mengxiao Liu2Zhoupeng Ren3Xining Zhang4Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, ChinaInstitute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing 210023, China; Corresponding author at: Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China.Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, ChinaInstitute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, ChinaInstitute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, ChinaTracking progress in poverty alleviation and promptly identifying the distribution of poor areas are critical for strategic policy interventions, especially for regions with poor statistical systems. The massive satellite imagery and geospatial data provide great opportunities for timely and cost-effective socioeconomic evaluations. However, existing research on poverty identification is mostly based on satellite images, and the potential of combined multi-source geospatial data on poverty identification has not been fully explored. Here, we propose an approach that evaluates how village-level poverty can be identified by integrating high-resolution imagery (HRI), point-of-interest (POI), OpenStreetMap (OSM), and digital surface model (DSM) data. The study area included 338 villages from Yunyang County, located in Hubei Province, central China. We extracted the explanatory variables indicating access to facilities and services, agricultural production conditions, village construction, and the spatial distribution of village settlements from the HRI, POI, OSM, and DSM data. The random forest algorithm was then used to model the relationship between village-level poverty and explanatory variables. The results demonstrated a 54% accuracy in the prediction of village-level poverty; the best prediction performance (72%) was observed for the villages categorized as poor. The built-up land proportion and the time cost to the facilities and services contributed the most to the identification of village-level poverty, while the proxy variables of agricultural production conditions contributed the least. This study provides an approach to village-level poverty identification using satellite imagery and geospatial data and proves that the data employed in this study could identify the poorest areas that are highly coupled with natural geographical conditions and backward public services.http://www.sciencedirect.com/science/article/pii/S0303243422000204PovertyHigh-resolution imageryGeospatial dataPoint-of-interest (POI)OpenStreetMap (OSM)Random forest
spellingShingle Shan Hu
Yong Ge
Mengxiao Liu
Zhoupeng Ren
Xining Zhang
Village-level poverty identification using machine learning, high-resolution images, and geospatial data
International Journal of Applied Earth Observations and Geoinformation
Poverty
High-resolution imagery
Geospatial data
Point-of-interest (POI)
OpenStreetMap (OSM)
Random forest
title Village-level poverty identification using machine learning, high-resolution images, and geospatial data
title_full Village-level poverty identification using machine learning, high-resolution images, and geospatial data
title_fullStr Village-level poverty identification using machine learning, high-resolution images, and geospatial data
title_full_unstemmed Village-level poverty identification using machine learning, high-resolution images, and geospatial data
title_short Village-level poverty identification using machine learning, high-resolution images, and geospatial data
title_sort village level poverty identification using machine learning high resolution images and geospatial data
topic Poverty
High-resolution imagery
Geospatial data
Point-of-interest (POI)
OpenStreetMap (OSM)
Random forest
url http://www.sciencedirect.com/science/article/pii/S0303243422000204
work_keys_str_mv AT shanhu villagelevelpovertyidentificationusingmachinelearninghighresolutionimagesandgeospatialdata
AT yongge villagelevelpovertyidentificationusingmachinelearninghighresolutionimagesandgeospatialdata
AT mengxiaoliu villagelevelpovertyidentificationusingmachinelearninghighresolutionimagesandgeospatialdata
AT zhoupengren villagelevelpovertyidentificationusingmachinelearninghighresolutionimagesandgeospatialdata
AT xiningzhang villagelevelpovertyidentificationusingmachinelearninghighresolutionimagesandgeospatialdata