Globe230k: A Benchmark Dense-Pixel Annotation Dataset for Global Land Cover Mapping

Global land cover map provides fundamental information for understanding the relationship between global environmental change and human settlement. With the development of data-driven deep learning theory, semantic segmentation network has largely facilitated the global land cover mapping activity....

Full description

Bibliographic Details
Main Authors: Qian Shi, Da He, Zhengyu Liu, Xiaoping Liu, Jingqian Xue
Format: Article
Language:English
Published: American Association for the Advancement of Science (AAAS) 2023-01-01
Series:Journal of Remote Sensing
Online Access:https://spj.science.org/doi/10.34133/remotesensing.0078
_version_ 1797659192989843456
author Qian Shi
Da He
Zhengyu Liu
Xiaoping Liu
Jingqian Xue
author_facet Qian Shi
Da He
Zhengyu Liu
Xiaoping Liu
Jingqian Xue
author_sort Qian Shi
collection DOAJ
description Global land cover map provides fundamental information for understanding the relationship between global environmental change and human settlement. With the development of data-driven deep learning theory, semantic segmentation network has largely facilitated the global land cover mapping activity. However, the performance of semantic segmentation network is closely related to the number and quality of training data, and the existing annotation data are usually insufficient in quantity, quality, and spatial resolution, and are usually sampled at local region and lack diversity and variability, making data-driven model difficult to extend to global scale. Therefore, we proposed a large-scale annotation dataset (Globe230k) for semantic segmentation of remote sensing image, which has 3 superiorities: (a) large scale: the Globe230k dataset includes 232,819 annotated images with a size of 512 × 512 and a spatial resolution of 1 m, including 10 first-level categories; (b) rich diversity: the annotated images are sampled from worldwide regions, with coverage area of over 60,000 km2, indicating a high variability and diversity; (c) multimodal: the Globe230k dataset not only contains RGB bands but also includes other important features for Earth system research, such as normalized differential vegetation index (NDVI), digital elevation model (DEM), vertical–vertical polarization (VV) bands, and vertical–horizontal polarization (VH) bands, which can facilitate the multimodal data fusion research. We used the Globe230k dataset to test several state-of-the-art semantic segmentation algorithms and found that it is able to evaluate algorithms in multiple aspects that are crucial for characterizing land covers, including multiscale modeling, detail reconstruction, and generalization ability. The dataset has been made public and can be used as a benchmark to promote further development of global land cover mapping and semantic segmentation algorithm development.
first_indexed 2024-03-11T18:11:22Z
format Article
id doaj.art-94687dee8ad34d089499240575f893fb
institution Directory Open Access Journal
issn 2694-1589
language English
last_indexed 2024-03-11T18:11:22Z
publishDate 2023-01-01
publisher American Association for the Advancement of Science (AAAS)
record_format Article
series Journal of Remote Sensing
spelling doaj.art-94687dee8ad34d089499240575f893fb2023-10-16T13:53:43ZengAmerican Association for the Advancement of Science (AAAS)Journal of Remote Sensing2694-15892023-01-01310.34133/remotesensing.0078Globe230k: A Benchmark Dense-Pixel Annotation Dataset for Global Land Cover MappingQian Shi0Da He1Zhengyu Liu2Xiaoping Liu3Jingqian Xue4School of Geography and Planning,Sun Yat-sen University, Guangzhou 510275, China.School of Geography and Planning,Sun Yat-sen University, Guangzhou 510275, China.School of Geography and Planning,Sun Yat-sen University, Guangzhou 510275, China.School of Geography and Planning,Sun Yat-sen University, Guangzhou 510275, China.School of Geography and Planning,Sun Yat-sen University, Guangzhou 510275, China.Global land cover map provides fundamental information for understanding the relationship between global environmental change and human settlement. With the development of data-driven deep learning theory, semantic segmentation network has largely facilitated the global land cover mapping activity. However, the performance of semantic segmentation network is closely related to the number and quality of training data, and the existing annotation data are usually insufficient in quantity, quality, and spatial resolution, and are usually sampled at local region and lack diversity and variability, making data-driven model difficult to extend to global scale. Therefore, we proposed a large-scale annotation dataset (Globe230k) for semantic segmentation of remote sensing image, which has 3 superiorities: (a) large scale: the Globe230k dataset includes 232,819 annotated images with a size of 512 × 512 and a spatial resolution of 1 m, including 10 first-level categories; (b) rich diversity: the annotated images are sampled from worldwide regions, with coverage area of over 60,000 km2, indicating a high variability and diversity; (c) multimodal: the Globe230k dataset not only contains RGB bands but also includes other important features for Earth system research, such as normalized differential vegetation index (NDVI), digital elevation model (DEM), vertical–vertical polarization (VV) bands, and vertical–horizontal polarization (VH) bands, which can facilitate the multimodal data fusion research. We used the Globe230k dataset to test several state-of-the-art semantic segmentation algorithms and found that it is able to evaluate algorithms in multiple aspects that are crucial for characterizing land covers, including multiscale modeling, detail reconstruction, and generalization ability. The dataset has been made public and can be used as a benchmark to promote further development of global land cover mapping and semantic segmentation algorithm development.https://spj.science.org/doi/10.34133/remotesensing.0078
spellingShingle Qian Shi
Da He
Zhengyu Liu
Xiaoping Liu
Jingqian Xue
Globe230k: A Benchmark Dense-Pixel Annotation Dataset for Global Land Cover Mapping
Journal of Remote Sensing
title Globe230k: A Benchmark Dense-Pixel Annotation Dataset for Global Land Cover Mapping
title_full Globe230k: A Benchmark Dense-Pixel Annotation Dataset for Global Land Cover Mapping
title_fullStr Globe230k: A Benchmark Dense-Pixel Annotation Dataset for Global Land Cover Mapping
title_full_unstemmed Globe230k: A Benchmark Dense-Pixel Annotation Dataset for Global Land Cover Mapping
title_short Globe230k: A Benchmark Dense-Pixel Annotation Dataset for Global Land Cover Mapping
title_sort globe230k a benchmark dense pixel annotation dataset for global land cover mapping
url https://spj.science.org/doi/10.34133/remotesensing.0078
work_keys_str_mv AT qianshi globe230kabenchmarkdensepixelannotationdatasetforgloballandcovermapping
AT dahe globe230kabenchmarkdensepixelannotationdatasetforgloballandcovermapping
AT zhengyuliu globe230kabenchmarkdensepixelannotationdatasetforgloballandcovermapping
AT xiaopingliu globe230kabenchmarkdensepixelannotationdatasetforgloballandcovermapping
AT jingqianxue globe230kabenchmarkdensepixelannotationdatasetforgloballandcovermapping