Comparison of common classification strategies for large-scale vegetation mapping over the Google Earth Engine platform

Vegetation resources have an essential role in sustainable development due to their close relationship with natural resource management and environmental protection. The monitoring of land use and cover is key for a more sustainable management of these resources, and Earth Observation satellites hav...

Full description

Bibliographic Details
Main Authors: Tomás Marín Del Valle, Ping Jiang
Format: Article
Language:English
Published: Elsevier 2022-12-01
Series:International Journal of Applied Earth Observations and Geoinformation
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1569843222002801
Description
Summary:Vegetation resources have an essential role in sustainable development due to their close relationship with natural resource management and environmental protection. The monitoring of land use and cover is key for a more sustainable management of these resources, and Earth Observation satellites have provided an increasingly powerful platform for performing this task. To date, numerous classification algorithms have been developed for vegetation mapping, but a comparative evaluation of the strategies commonly used in large-scale applications (50000 km2 and above) is lacking. We developed a classification framework based on random forests and Sentinel data within the Google Earth Engine platform to assess the performance of various strategies over a complex landscape with a wide range of vegetation classes, plot configurations, and agricultural practices. These strategies differed in key aspects related to the characteristics of the classifier and the sample, and were evaluated by class-specific and general accuracy metrics. We found that the use of pixel time series features from fusion data, class-balanced labels, and multi-season time frames enhanced overall performance by 1.3%–14.1% over alternative approaches, but in some cases also generated tradeoffs of 1.6%–8.5% between recall and precision. In general, suboptimal strategies were particularly ineffective for the detection of infrequent classes. Finally, the different parameter values used in the random forests did not have a significant influence over the results. Our results demonstrate the importance of algorithm design in the effective classification of multiple vegetation classes, corroborate the usefulness of Sentinel data to generate mapping products of high resolution and accuracy, and highlight the importance of cloud computing tools for the development of vegetation mapping tools in large-scale applications. These findings provide general guidelines for the design of future classification frameworks, which are necessary for facilitating a sustainable management of natural resources worldwide.
ISSN:1569-8432