SeaMAE: Masked Pre-Training with Meteorological Satellite Imagery for Sea Fog Detection

Sea fog detection (SFD) presents a significant challenge in the field of intelligent Earth observation, particularly in analyzing meteorological satellite imagery. Akin to various vision tasks, ImageNet pre-training is commonly used for pre-training SFD. However, in the context of multi-spectral met...

Full description

Bibliographic Details
Main Authors: Haotian Yan, Sundingkai Su, Ming Wu, Mengqiu Xu, Yihao Zuo, Chuang Zhang, Bin Huang
Format: Article
Language:English
Published: MDPI AG 2023-08-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/15/16/4102
_version_ 1827728760943673344
author Haotian Yan
Sundingkai Su
Ming Wu
Mengqiu Xu
Yihao Zuo
Chuang Zhang
Bin Huang
author_facet Haotian Yan
Sundingkai Su
Ming Wu
Mengqiu Xu
Yihao Zuo
Chuang Zhang
Bin Huang
author_sort Haotian Yan
collection DOAJ
description Sea fog detection (SFD) presents a significant challenge in the field of intelligent Earth observation, particularly in analyzing meteorological satellite imagery. Akin to various vision tasks, ImageNet pre-training is commonly used for pre-training SFD. However, in the context of multi-spectral meteorological satellite imagery, the initial step of deep learning has received limited attention. Recently, pre-training with Very High-Resolution (VHR) satellite imagery has gained increased popularity in remote-sensing vision tasks, showing the potential to replace ImageNet pre-training. However, it is worth noting that the meteorological satellite imagery applied in SFD, despite being an application of computer vision in remote sensing, differs greatly from VHR satellite imagery. To address the limitation of pre-training for SFD, this paper introduces a novel deep-learning paradigm to the meteorological domain driven by Masked Image Modeling (MIM). Our research reveals two key insights: (1) Pre-training with meteorological satellite imagery yields superior SFD performance compared to pre-training with nature imagery and VHR satellite imagery. (2) Incorporating the architectural characteristics of SFD models into a vanilla masked autoencoder (MAE) can augment the effectiveness of meteorological pre-training. To facilitate this research, we curate a pre-training dataset comprising 514,655 temporal multi-spectral meteorological satellite images, covering the Bohai Sea and Yellow Sea regions, which have the most sea fog occurrence. The longitude ranges from 115.00E to 128.75E, and the latitude ranges from 27.60N to 41.35N. Moreover, we introduce SeaMAE, a novel MAE that utilizes a Vision Transformer as the encoder and a convolutional hierarchical decoder, to learn meteorological representations. SeaMAE is pre-trained on this dataset and fine-tuned for SFD, resulting in state-of-the-art performance. For instance, using the ViT-Base as the backbone, SeaMAE pre-training which achieves <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>64.18</mn><mo>%</mo></mrow></semantics></math></inline-formula> surpasses from-scratch learning, natural imagery pre-training, and VRH satellite imagery pre-training by <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>5.53</mn><mo>%</mo></mrow></semantics></math></inline-formula>, <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2.49</mn><mo>%</mo></mrow></semantics></math></inline-formula>, and <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2.21</mn><mo>%</mo></mrow></semantics></math></inline-formula>, respectively, in terms of Intersection over Union of SFD.
first_indexed 2024-03-10T23:36:29Z
format Article
id doaj.art-481e5a0757884e8da95dccb67db6202e
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-10T23:36:29Z
publishDate 2023-08-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-481e5a0757884e8da95dccb67db6202e2023-11-19T02:54:34ZengMDPI AGRemote Sensing2072-42922023-08-011516410210.3390/rs15164102SeaMAE: Masked Pre-Training with Meteorological Satellite Imagery for Sea Fog DetectionHaotian Yan0Sundingkai Su1Ming Wu2Mengqiu Xu3Yihao Zuo4Chuang Zhang5Bin Huang6Artificial Intelligence School, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaArtificial Intelligence School, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaArtificial Intelligence School, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaArtificial Intelligence School, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaArtificial Intelligence School, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaArtificial Intelligence School, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaNational Meteorological Center, China Meteorological Administration, Beijing 100081, ChinaSea fog detection (SFD) presents a significant challenge in the field of intelligent Earth observation, particularly in analyzing meteorological satellite imagery. Akin to various vision tasks, ImageNet pre-training is commonly used for pre-training SFD. However, in the context of multi-spectral meteorological satellite imagery, the initial step of deep learning has received limited attention. Recently, pre-training with Very High-Resolution (VHR) satellite imagery has gained increased popularity in remote-sensing vision tasks, showing the potential to replace ImageNet pre-training. However, it is worth noting that the meteorological satellite imagery applied in SFD, despite being an application of computer vision in remote sensing, differs greatly from VHR satellite imagery. To address the limitation of pre-training for SFD, this paper introduces a novel deep-learning paradigm to the meteorological domain driven by Masked Image Modeling (MIM). Our research reveals two key insights: (1) Pre-training with meteorological satellite imagery yields superior SFD performance compared to pre-training with nature imagery and VHR satellite imagery. (2) Incorporating the architectural characteristics of SFD models into a vanilla masked autoencoder (MAE) can augment the effectiveness of meteorological pre-training. To facilitate this research, we curate a pre-training dataset comprising 514,655 temporal multi-spectral meteorological satellite images, covering the Bohai Sea and Yellow Sea regions, which have the most sea fog occurrence. The longitude ranges from 115.00E to 128.75E, and the latitude ranges from 27.60N to 41.35N. Moreover, we introduce SeaMAE, a novel MAE that utilizes a Vision Transformer as the encoder and a convolutional hierarchical decoder, to learn meteorological representations. SeaMAE is pre-trained on this dataset and fine-tuned for SFD, resulting in state-of-the-art performance. For instance, using the ViT-Base as the backbone, SeaMAE pre-training which achieves <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>64.18</mn><mo>%</mo></mrow></semantics></math></inline-formula> surpasses from-scratch learning, natural imagery pre-training, and VRH satellite imagery pre-training by <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>5.53</mn><mo>%</mo></mrow></semantics></math></inline-formula>, <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2.49</mn><mo>%</mo></mrow></semantics></math></inline-formula>, and <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2.21</mn><mo>%</mo></mrow></semantics></math></inline-formula>, respectively, in terms of Intersection over Union of SFD.https://www.mdpi.com/2072-4292/15/16/4102sea fog detectionpre-trainingmasked autoencodersmeteorological satellite imagery
spellingShingle Haotian Yan
Sundingkai Su
Ming Wu
Mengqiu Xu
Yihao Zuo
Chuang Zhang
Bin Huang
SeaMAE: Masked Pre-Training with Meteorological Satellite Imagery for Sea Fog Detection
Remote Sensing
sea fog detection
pre-training
masked autoencoders
meteorological satellite imagery
title SeaMAE: Masked Pre-Training with Meteorological Satellite Imagery for Sea Fog Detection
title_full SeaMAE: Masked Pre-Training with Meteorological Satellite Imagery for Sea Fog Detection
title_fullStr SeaMAE: Masked Pre-Training with Meteorological Satellite Imagery for Sea Fog Detection
title_full_unstemmed SeaMAE: Masked Pre-Training with Meteorological Satellite Imagery for Sea Fog Detection
title_short SeaMAE: Masked Pre-Training with Meteorological Satellite Imagery for Sea Fog Detection
title_sort seamae masked pre training with meteorological satellite imagery for sea fog detection
topic sea fog detection
pre-training
masked autoencoders
meteorological satellite imagery
url https://www.mdpi.com/2072-4292/15/16/4102
work_keys_str_mv AT haotianyan seamaemaskedpretrainingwithmeteorologicalsatelliteimageryforseafogdetection
AT sundingkaisu seamaemaskedpretrainingwithmeteorologicalsatelliteimageryforseafogdetection
AT mingwu seamaemaskedpretrainingwithmeteorologicalsatelliteimageryforseafogdetection
AT mengqiuxu seamaemaskedpretrainingwithmeteorologicalsatelliteimageryforseafogdetection
AT yihaozuo seamaemaskedpretrainingwithmeteorologicalsatelliteimageryforseafogdetection
AT chuangzhang seamaemaskedpretrainingwithmeteorologicalsatelliteimageryforseafogdetection
AT binhuang seamaemaskedpretrainingwithmeteorologicalsatelliteimageryforseafogdetection