Integrating Crowdsourced Data with a Land Cover Product: A Bayesian Data Fusion Approach

For many environmental applications, an accurate spatial mapping of land cover is a major concern. Currently, land cover products derived from satellite data are expected to offer a fast and inexpensive way of mapping large areas. However, the quality of these products may also largely depend on the...

Full description

Bibliographic Details
Main Authors: Sarah Gengler, Patrick Bogaert
Format: Article
Language:English
Published: MDPI AG 2016-06-01
Series:Remote Sensing
Subjects:
Online Access:http://www.mdpi.com/2072-4292/8/7/545
_version_ 1798024484770283520
author Sarah Gengler
Patrick Bogaert
author_facet Sarah Gengler
Patrick Bogaert
author_sort Sarah Gengler
collection DOAJ
description For many environmental applications, an accurate spatial mapping of land cover is a major concern. Currently, land cover products derived from satellite data are expected to offer a fast and inexpensive way of mapping large areas. However, the quality of these products may also largely depend on the area under study. As a result, it is common that various products disagree with each other, and the assessment of their respective quality still relies on ground validation datasets. Recently, crowdsourced data have been suggested as an alternate source of information that might help overcome this problem. However, crowdsourced data still remain largely discarded in scientific studies due to their inherent poor quality assurance. The aim of this paper is to present an efficient methodology that allows the user to code information brought by crowdsourced data even if no prior quality estimation is at hand and possibly to fuse this information with existing land cover products in order to improve their accuracy. It is first suggested that information brought by volunteers can be coded as a set of inequality constraints about the probabilities of the various land use classes at the visited places. This in turn allows estimating optimal probabilities based on a maximum entropy principle and to proceed afterwards with a spatial interpolation of these volunteers’ information. Finally, a Bayesian data fusion approach can be used for fusing multiple volunteers’ contributions with a remotely-sensed land cover product. This methodology is illustrated in this paper by focusing on the mapping of croplands in Ethiopia, where the aim is to improve the mapping of cropland as coming out from a land cover product with mitigated performances. It is shown how crowdsourced information can seriously improve the quality of the final product. The corresponding results also suggest that a prior assessing of remotely-sensed data quality can seriously improve the benefit of crowdsourcing campaigns, so that both sources of information need to be accounted together in order to optimize the sampling efforts.
first_indexed 2024-04-11T18:03:10Z
format Article
id doaj.art-8c3a75c4883d441595794336131d6d5d
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-04-11T18:03:10Z
publishDate 2016-06-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-8c3a75c4883d441595794336131d6d5d2022-12-22T04:10:23ZengMDPI AGRemote Sensing2072-42922016-06-018754510.3390/rs8070545rs8070545Integrating Crowdsourced Data with a Land Cover Product: A Bayesian Data Fusion ApproachSarah Gengler0Patrick Bogaert1Earth and Life Institute, Environmental Sciences, Université Catholique de Louvain, Croix du Sud 2/L7.05.16, B-1348 Louvain-la-Neuve, BelgiumEarth and Life Institute, Environmental Sciences, Université Catholique de Louvain, Croix du Sud 2/L7.05.16, B-1348 Louvain-la-Neuve, BelgiumFor many environmental applications, an accurate spatial mapping of land cover is a major concern. Currently, land cover products derived from satellite data are expected to offer a fast and inexpensive way of mapping large areas. However, the quality of these products may also largely depend on the area under study. As a result, it is common that various products disagree with each other, and the assessment of their respective quality still relies on ground validation datasets. Recently, crowdsourced data have been suggested as an alternate source of information that might help overcome this problem. However, crowdsourced data still remain largely discarded in scientific studies due to their inherent poor quality assurance. The aim of this paper is to present an efficient methodology that allows the user to code information brought by crowdsourced data even if no prior quality estimation is at hand and possibly to fuse this information with existing land cover products in order to improve their accuracy. It is first suggested that information brought by volunteers can be coded as a set of inequality constraints about the probabilities of the various land use classes at the visited places. This in turn allows estimating optimal probabilities based on a maximum entropy principle and to proceed afterwards with a spatial interpolation of these volunteers’ information. Finally, a Bayesian data fusion approach can be used for fusing multiple volunteers’ contributions with a remotely-sensed land cover product. This methodology is illustrated in this paper by focusing on the mapping of croplands in Ethiopia, where the aim is to improve the mapping of cropland as coming out from a land cover product with mitigated performances. It is shown how crowdsourced information can seriously improve the quality of the final product. The corresponding results also suggest that a prior assessing of remotely-sensed data quality can seriously improve the benefit of crowdsourcing campaigns, so that both sources of information need to be accounted together in order to optimize the sampling efforts.http://www.mdpi.com/2072-4292/8/7/545crowdsourcingland cover productsBayesian data fusionmaximum entropyEthiopia
spellingShingle Sarah Gengler
Patrick Bogaert
Integrating Crowdsourced Data with a Land Cover Product: A Bayesian Data Fusion Approach
Remote Sensing
crowdsourcing
land cover products
Bayesian data fusion
maximum entropy
Ethiopia
title Integrating Crowdsourced Data with a Land Cover Product: A Bayesian Data Fusion Approach
title_full Integrating Crowdsourced Data with a Land Cover Product: A Bayesian Data Fusion Approach
title_fullStr Integrating Crowdsourced Data with a Land Cover Product: A Bayesian Data Fusion Approach
title_full_unstemmed Integrating Crowdsourced Data with a Land Cover Product: A Bayesian Data Fusion Approach
title_short Integrating Crowdsourced Data with a Land Cover Product: A Bayesian Data Fusion Approach
title_sort integrating crowdsourced data with a land cover product a bayesian data fusion approach
topic crowdsourcing
land cover products
Bayesian data fusion
maximum entropy
Ethiopia
url http://www.mdpi.com/2072-4292/8/7/545
work_keys_str_mv AT sarahgengler integratingcrowdsourceddatawithalandcoverproductabayesiandatafusionapproach
AT patrickbogaert integratingcrowdsourceddatawithalandcoverproductabayesiandatafusionapproach