Combining Masked Autoencoding and Neural Fields for Multi-band Satellite Understanding

Multi-spectral satellite remote sensing is a primary way to monitor planet-scale events such as deforestation, land-cover change, fire, and flooding. Unfortunately, incomplete spatial coverage and sparse temporal sampling make it challenging to develop a unified understanding of the environment. We...

Full description

Bibliographic Details
Main Author: Huang, Kuan Wei
Other Authors: Freeman, William T.
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/150309
_version_ 1826206888627273728
author Huang, Kuan Wei
author2 Freeman, William T.
author_facet Freeman, William T.
Huang, Kuan Wei
author_sort Huang, Kuan Wei
collection MIT
description Multi-spectral satellite remote sensing is a primary way to monitor planet-scale events such as deforestation, land-cover change, fire, and flooding. Unfortunately, incomplete spatial coverage and sparse temporal sampling make it challenging to develop a unified understanding of the environment. We aim to solve these challenges by creating a curated multi-modal satellite remote sensing dataset and presenting a novel architecture that learns a unified representation across large-scale heterogeneous remote sensing data by solving an image completion task. We equip our model with temporal, spectral, and global positioning information in addition to local positional encoding. This allows our algorithm to learn a unified, high-resolution, and time-varying representation across the entire survey area. Unlike the prior work, our architecture does not require data with uniform coverage, temporal resolution, or paired bands, and through prompting, it can act as a method for satellite infilling, temporal prediction, and cross-band translation. We train and evaluate our approach on a multi-modal remote sensing dataset and show that it outperforms baselines across satellite completion and cross-band translation tasks. In addition, we show that the neural feature field learned by our method is more effective than baselines for transfer learning to predict Amazon rainforest deforestation.
first_indexed 2024-09-23T13:39:52Z
format Thesis
id mit-1721.1/150309
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T13:39:52Z
publishDate 2023
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1503092023-04-01T03:27:19Z Combining Masked Autoencoding and Neural Fields for Multi-band Satellite Understanding Huang, Kuan Wei Freeman, William T. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Multi-spectral satellite remote sensing is a primary way to monitor planet-scale events such as deforestation, land-cover change, fire, and flooding. Unfortunately, incomplete spatial coverage and sparse temporal sampling make it challenging to develop a unified understanding of the environment. We aim to solve these challenges by creating a curated multi-modal satellite remote sensing dataset and presenting a novel architecture that learns a unified representation across large-scale heterogeneous remote sensing data by solving an image completion task. We equip our model with temporal, spectral, and global positioning information in addition to local positional encoding. This allows our algorithm to learn a unified, high-resolution, and time-varying representation across the entire survey area. Unlike the prior work, our architecture does not require data with uniform coverage, temporal resolution, or paired bands, and through prompting, it can act as a method for satellite infilling, temporal prediction, and cross-band translation. We train and evaluate our approach on a multi-modal remote sensing dataset and show that it outperforms baselines across satellite completion and cross-band translation tasks. In addition, we show that the neural feature field learned by our method is more effective than baselines for transfer learning to predict Amazon rainforest deforestation. M.Eng. 2023-03-31T14:46:47Z 2023-03-31T14:46:47Z 2023-02 2023-02-27T18:43:22.491Z Thesis https://hdl.handle.net/1721.1/150309 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Huang, Kuan Wei
Combining Masked Autoencoding and Neural Fields for Multi-band Satellite Understanding
title Combining Masked Autoencoding and Neural Fields for Multi-band Satellite Understanding
title_full Combining Masked Autoencoding and Neural Fields for Multi-band Satellite Understanding
title_fullStr Combining Masked Autoencoding and Neural Fields for Multi-band Satellite Understanding
title_full_unstemmed Combining Masked Autoencoding and Neural Fields for Multi-band Satellite Understanding
title_short Combining Masked Autoencoding and Neural Fields for Multi-band Satellite Understanding
title_sort combining masked autoencoding and neural fields for multi band satellite understanding
url https://hdl.handle.net/1721.1/150309
work_keys_str_mv AT huangkuanwei combiningmaskedautoencodingandneuralfieldsformultibandsatelliteunderstanding