Exploring Machine Learning to Correct Satellite-Derived Sea Surface Temperatures

Machine learning techniques are attractive tools to establish statistical models with a high degree of non linearity. They require a large amount of data to be trained and are therefore particularly suited to analysing remote sensing data. This work is an attempt at using advanced statistical method...

Full description

Bibliographic Details
Main Authors: Stéphane Saux Picart, Pierre Tandeo, Emmanuelle Autret, Blandine Gausset
Format: Article
Language:English
Published: MDPI AG 2018-02-01
Series:Remote Sensing
Subjects:
Online Access:http://www.mdpi.com/2072-4292/10/2/224
_version_ 1818870363661008896
author Stéphane Saux Picart
Pierre Tandeo
Emmanuelle Autret
Blandine Gausset
author_facet Stéphane Saux Picart
Pierre Tandeo
Emmanuelle Autret
Blandine Gausset
author_sort Stéphane Saux Picart
collection DOAJ
description Machine learning techniques are attractive tools to establish statistical models with a high degree of non linearity. They require a large amount of data to be trained and are therefore particularly suited to analysing remote sensing data. This work is an attempt at using advanced statistical methods of machine learning to predict the bias between Sea Surface Temperature (SST) derived from infrared remote sensing and ground “truth” from drifting buoy measurements. A large dataset of collocation between satellite SST and in situ SST is explored. Four regression models are used: Simple multi-linear regression, Least Square Shrinkage and Selection Operator (LASSO), Generalised Additive Model (GAM) and random forest. In the case of geostationary satellites for which a large number of collocations is available, results show that the random forest model is the best model to predict the systematic errors and it is computationally fast, making it a good candidate for operational processing. It is able to explain nearly 31% of the total variance of the bias (in comparison to about 24% for the multi-linear regression model).
first_indexed 2024-12-19T12:05:50Z
format Article
id doaj.art-fb6e1292325a475bbc197527bd2d5051
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-12-19T12:05:50Z
publishDate 2018-02-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-fb6e1292325a475bbc197527bd2d50512022-12-21T20:22:22ZengMDPI AGRemote Sensing2072-42922018-02-0110222410.3390/rs10020224rs10020224Exploring Machine Learning to Correct Satellite-Derived Sea Surface TemperaturesStéphane Saux Picart0Pierre Tandeo1Emmanuelle Autret2Blandine Gausset3Météo-France/Centre de Météorologie Spatiale, Avenue de Lorraine, B.P. 50747, 22307 Lannion CEDEX, FranceIMT Atlantique, Lab-STICC, UBL, 29238 Brest, FranceIfremer, Laboratoire d’Océanographie Physique et Spatiale, ZI Pointe du Diable CS 10070, 29280 Plouzané, FranceMétéo-France/Centre de Météorologie Spatiale, Avenue de Lorraine, B.P. 50747, 22307 Lannion CEDEX, FranceMachine learning techniques are attractive tools to establish statistical models with a high degree of non linearity. They require a large amount of data to be trained and are therefore particularly suited to analysing remote sensing data. This work is an attempt at using advanced statistical methods of machine learning to predict the bias between Sea Surface Temperature (SST) derived from infrared remote sensing and ground “truth” from drifting buoy measurements. A large dataset of collocation between satellite SST and in situ SST is explored. Four regression models are used: Simple multi-linear regression, Least Square Shrinkage and Selection Operator (LASSO), Generalised Additive Model (GAM) and random forest. In the case of geostationary satellites for which a large number of collocations is available, results show that the random forest model is the best model to predict the systematic errors and it is computationally fast, making it a good candidate for operational processing. It is able to explain nearly 31% of the total variance of the bias (in comparison to about 24% for the multi-linear regression model).http://www.mdpi.com/2072-4292/10/2/224machine learningsystematic errorsea surface temperaturerandom forest
spellingShingle Stéphane Saux Picart
Pierre Tandeo
Emmanuelle Autret
Blandine Gausset
Exploring Machine Learning to Correct Satellite-Derived Sea Surface Temperatures
Remote Sensing
machine learning
systematic error
sea surface temperature
random forest
title Exploring Machine Learning to Correct Satellite-Derived Sea Surface Temperatures
title_full Exploring Machine Learning to Correct Satellite-Derived Sea Surface Temperatures
title_fullStr Exploring Machine Learning to Correct Satellite-Derived Sea Surface Temperatures
title_full_unstemmed Exploring Machine Learning to Correct Satellite-Derived Sea Surface Temperatures
title_short Exploring Machine Learning to Correct Satellite-Derived Sea Surface Temperatures
title_sort exploring machine learning to correct satellite derived sea surface temperatures
topic machine learning
systematic error
sea surface temperature
random forest
url http://www.mdpi.com/2072-4292/10/2/224
work_keys_str_mv AT stephanesauxpicart exploringmachinelearningtocorrectsatellitederivedseasurfacetemperatures
AT pierretandeo exploringmachinelearningtocorrectsatellitederivedseasurfacetemperatures
AT emmanuelleautret exploringmachinelearningtocorrectsatellitederivedseasurfacetemperatures
AT blandinegausset exploringmachinelearningtocorrectsatellitederivedseasurfacetemperatures