Improving Deep Learning for Maritime Remote Sensing through Data Augmentation and Latent Space

Training deep learning models requires having the right data for the problem and understanding both your data and the models’ performance on that data. Training deep learning models is difficult when data are limited, so in this paper, we seek to answer the following question: how can we train a dee...

Full description

Bibliographic Details
Main Authors: Daniel Sobien, Erik Higgins, Justin Krometis, Justin Kauffman, Laura Freeman
Format: Article
Language:English
Published: MDPI AG 2022-07-01
Series:Machine Learning and Knowledge Extraction
Subjects:
Online Access:https://www.mdpi.com/2504-4990/4/3/31
_version_ 1827659394093940736
author Daniel Sobien
Erik Higgins
Justin Krometis
Justin Kauffman
Laura Freeman
author_facet Daniel Sobien
Erik Higgins
Justin Krometis
Justin Kauffman
Laura Freeman
author_sort Daniel Sobien
collection DOAJ
description Training deep learning models requires having the right data for the problem and understanding both your data and the models’ performance on that data. Training deep learning models is difficult when data are limited, so in this paper, we seek to answer the following question: how can we train a deep learning model to increase its performance on a targeted area with limited data? We do this by applying rotation data augmentations to a simulated synthetic aperture radar (SAR) image dataset. We use the Uniform Manifold Approximation and Projection (UMAP) dimensionality reduction technique to understand the effects of augmentations on the data in latent space. Using this latent space representation, we can understand the data and choose specific training samples aimed at boosting model performance in targeted under-performing regions without the need to increase training set sizes. Results show that using latent space to choose training data significantly improves model performance in some cases; however, there are other cases where no improvements are made. We show that linking patterns in latent space is a possible predictor of model performance, but results require some experimentation and domain knowledge to determine the best options.
first_indexed 2024-03-09T23:20:51Z
format Article
id doaj.art-bec0ec685d90495f8edad4634061fb3e
institution Directory Open Access Journal
issn 2504-4990
language English
last_indexed 2024-03-09T23:20:51Z
publishDate 2022-07-01
publisher MDPI AG
record_format Article
series Machine Learning and Knowledge Extraction
spelling doaj.art-bec0ec685d90495f8edad4634061fb3e2023-11-23T17:28:09ZengMDPI AGMachine Learning and Knowledge Extraction2504-49902022-07-014366568710.3390/make4030031Improving Deep Learning for Maritime Remote Sensing through Data Augmentation and Latent SpaceDaniel Sobien0Erik Higgins1Justin Krometis2Justin Kauffman3Laura Freeman4National Security Institute, Virginia Tech, Arlington, VA 22203, USADepartment of Aerospace and Ocean Engineering, Virginia Tech, Blacksburg, VA 24061, USANational Security Institute, Virginia Tech, Arlington, VA 22203, USANational Security Institute, Virginia Tech, Arlington, VA 22203, USANational Security Institute, Virginia Tech, Arlington, VA 22203, USATraining deep learning models requires having the right data for the problem and understanding both your data and the models’ performance on that data. Training deep learning models is difficult when data are limited, so in this paper, we seek to answer the following question: how can we train a deep learning model to increase its performance on a targeted area with limited data? We do this by applying rotation data augmentations to a simulated synthetic aperture radar (SAR) image dataset. We use the Uniform Manifold Approximation and Projection (UMAP) dimensionality reduction technique to understand the effects of augmentations on the data in latent space. Using this latent space representation, we can understand the data and choose specific training samples aimed at boosting model performance in targeted under-performing regions without the need to increase training set sizes. Results show that using latent space to choose training data significantly improves model performance in some cases; however, there are other cases where no improvements are made. We show that linking patterns in latent space is a possible predictor of model performance, but results require some experimentation and domain knowledge to determine the best options.https://www.mdpi.com/2504-4990/4/3/31data augmentationdimensionality reductionlatent spaceUMAPsimulated datadeep neural network
spellingShingle Daniel Sobien
Erik Higgins
Justin Krometis
Justin Kauffman
Laura Freeman
Improving Deep Learning for Maritime Remote Sensing through Data Augmentation and Latent Space
Machine Learning and Knowledge Extraction
data augmentation
dimensionality reduction
latent space
UMAP
simulated data
deep neural network
title Improving Deep Learning for Maritime Remote Sensing through Data Augmentation and Latent Space
title_full Improving Deep Learning for Maritime Remote Sensing through Data Augmentation and Latent Space
title_fullStr Improving Deep Learning for Maritime Remote Sensing through Data Augmentation and Latent Space
title_full_unstemmed Improving Deep Learning for Maritime Remote Sensing through Data Augmentation and Latent Space
title_short Improving Deep Learning for Maritime Remote Sensing through Data Augmentation and Latent Space
title_sort improving deep learning for maritime remote sensing through data augmentation and latent space
topic data augmentation
dimensionality reduction
latent space
UMAP
simulated data
deep neural network
url https://www.mdpi.com/2504-4990/4/3/31
work_keys_str_mv AT danielsobien improvingdeeplearningformaritimeremotesensingthroughdataaugmentationandlatentspace
AT erikhiggins improvingdeeplearningformaritimeremotesensingthroughdataaugmentationandlatentspace
AT justinkrometis improvingdeeplearningformaritimeremotesensingthroughdataaugmentationandlatentspace
AT justinkauffman improvingdeeplearningformaritimeremotesensingthroughdataaugmentationandlatentspace
AT laurafreeman improvingdeeplearningformaritimeremotesensingthroughdataaugmentationandlatentspace