Localization-Aware Adaptive Pairwise Margin Loss for Fine-Grained Image Recognition

Fine-grained image recognition is a highly challenging problem due to subtle differences between images. There are many attempts to solve fine-grained image recognition problems using data augmentation, jointly optimizing deep metric learning. CutMix is one of the excellent data augmentation strateg...

Full description

Bibliographic Details
Main Authors: Taehung Kim, Hoseong Kim, Hyeran Byun
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9313990/
_version_ 1828406430384783360
author Taehung Kim
Hoseong Kim
Hyeran Byun
author_facet Taehung Kim
Hoseong Kim
Hyeran Byun
author_sort Taehung Kim
collection DOAJ
description Fine-grained image recognition is a highly challenging problem due to subtle differences between images. There are many attempts to solve fine-grained image recognition problems using data augmentation, jointly optimizing deep metric learning. CutMix is one of the excellent data augmentation strategies which crops and merges to generate new images. However, it sometimes generates meaningless and obscured object images that degrade recognition performance. We propose a novel framework that solves the above problem and expands the CutMix leveraging localizing method. Also, we improve the recognition accuracy to joint optimizing with a pairwise margin loss using generated images from the improved CutMix. There are some images similar to the reference image among the generated images. They are generated by replacing similar parts from the reference image. Those generated images should not be located much farther than the margin value in embedding space because those generated images and a reference image have similar semantic meaning. However, the conventional margin loss can not consider those images which are located much farther than the margin. To solve this problem, we propose an additional margin loss to consider those generated images. The proposed framework consists of two stages: the part localization-aware CutMix and an adaptive pairwise margin loss. The proposed method achieves state-of-the-art performance on the CUB-200-2011, FGVC-Aircraft, Stanford Cars, and DeepFashion datasets. Furthermore, extensive experiments demonstrate that each stage improves the final performance.
first_indexed 2024-12-10T11:11:36Z
format Article
id doaj.art-b0214248190c49e8b6d81550df48126f
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-10T11:11:36Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-b0214248190c49e8b6d81550df48126f2022-12-22T01:51:24ZengIEEEIEEE Access2169-35362021-01-0198786879610.1109/ACCESS.2021.30493059313990Localization-Aware Adaptive Pairwise Margin Loss for Fine-Grained Image RecognitionTaehung Kim0https://orcid.org/0000-0002-2233-7881Hoseong Kim1https://orcid.org/0000-0002-1583-1558Hyeran Byun2https://orcid.org/0000-0002-3082-3214Department of Computer Science, Yonsei University, Seoul, South KoreaAgency for Defense Development, Daejeon, South KoreaDepartment of Computer Science, Yonsei University, Seoul, South KoreaFine-grained image recognition is a highly challenging problem due to subtle differences between images. There are many attempts to solve fine-grained image recognition problems using data augmentation, jointly optimizing deep metric learning. CutMix is one of the excellent data augmentation strategies which crops and merges to generate new images. However, it sometimes generates meaningless and obscured object images that degrade recognition performance. We propose a novel framework that solves the above problem and expands the CutMix leveraging localizing method. Also, we improve the recognition accuracy to joint optimizing with a pairwise margin loss using generated images from the improved CutMix. There are some images similar to the reference image among the generated images. They are generated by replacing similar parts from the reference image. Those generated images should not be located much farther than the margin value in embedding space because those generated images and a reference image have similar semantic meaning. However, the conventional margin loss can not consider those images which are located much farther than the margin. To solve this problem, we propose an additional margin loss to consider those generated images. The proposed framework consists of two stages: the part localization-aware CutMix and an adaptive pairwise margin loss. The proposed method achieves state-of-the-art performance on the CUB-200-2011, FGVC-Aircraft, Stanford Cars, and DeepFashion datasets. Furthermore, extensive experiments demonstrate that each stage improves the final performance.https://ieeexplore.ieee.org/document/9313990/Adaptive margindeep neural networksfine-grained image recognitionmetric learningimage augmentationimage generation
spellingShingle Taehung Kim
Hoseong Kim
Hyeran Byun
Localization-Aware Adaptive Pairwise Margin Loss for Fine-Grained Image Recognition
IEEE Access
Adaptive margin
deep neural networks
fine-grained image recognition
metric learning
image augmentation
image generation
title Localization-Aware Adaptive Pairwise Margin Loss for Fine-Grained Image Recognition
title_full Localization-Aware Adaptive Pairwise Margin Loss for Fine-Grained Image Recognition
title_fullStr Localization-Aware Adaptive Pairwise Margin Loss for Fine-Grained Image Recognition
title_full_unstemmed Localization-Aware Adaptive Pairwise Margin Loss for Fine-Grained Image Recognition
title_short Localization-Aware Adaptive Pairwise Margin Loss for Fine-Grained Image Recognition
title_sort localization aware adaptive pairwise margin loss for fine grained image recognition
topic Adaptive margin
deep neural networks
fine-grained image recognition
metric learning
image augmentation
image generation
url https://ieeexplore.ieee.org/document/9313990/
work_keys_str_mv AT taehungkim localizationawareadaptivepairwisemarginlossforfinegrainedimagerecognition
AT hoseongkim localizationawareadaptivepairwisemarginlossforfinegrainedimagerecognition
AT hyeranbyun localizationawareadaptivepairwisemarginlossforfinegrainedimagerecognition