LCReg: long-tailed image classification with latent categories based recognition

In this work, we tackle the challenging problem of long-tailed image recognition. Previous long-tailed recognition approaches mainly focus on data augmentation or re-balancing strategies for the tail classes to give them more attention during model training. However, these methods are limited by the...

Full description

Bibliographic Details
Main Authors: Liu, Weide, Wu, Zhonghua, Wang, Yiming, Ding, Henghui, Liu, Fayao, Lin, Jie, Lin, Guosheng
Other Authors: School of Computer Science and Engineering
Format: Journal Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/170932
_version_ 1811685396716716032
author Liu, Weide
Wu, Zhonghua
Wang, Yiming
Ding, Henghui
Liu, Fayao
Lin, Jie
Lin, Guosheng
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Liu, Weide
Wu, Zhonghua
Wang, Yiming
Ding, Henghui
Liu, Fayao
Lin, Jie
Lin, Guosheng
author_sort Liu, Weide
collection NTU
description In this work, we tackle the challenging problem of long-tailed image recognition. Previous long-tailed recognition approaches mainly focus on data augmentation or re-balancing strategies for the tail classes to give them more attention during model training. However, these methods are limited by the small number of training images for the tail classes, which results in poor feature representations. To address this issue, we propose the Latent Categories based long-tail Recognition (LCReg) method. Our hypothesis is that common latent features shared by head and tail classes can be used to improve feature representation. Specifically, we learn a set of class-agnostic latent features shared by both head and tail classes, and then use semantic data augmentation on the latent features to implicitly increase the diversity of the training sample. We conduct extensive experiments on five long-tailed image recognition datasets, and the results show that our proposed method significantly improves the baselines.
first_indexed 2024-10-01T04:43:52Z
format Journal Article
id ntu-10356/170932
institution Nanyang Technological University
language English
last_indexed 2024-10-01T04:43:52Z
publishDate 2023
record_format dspace
spelling ntu-10356/1709322023-10-13T15:36:35Z LCReg: long-tailed image classification with latent categories based recognition Liu, Weide Wu, Zhonghua Wang, Yiming Ding, Henghui Liu, Fayao Lin, Jie Lin, Guosheng School of Computer Science and Engineering Engineering::Computer science and engineering Long-Tailed Image Classification In this work, we tackle the challenging problem of long-tailed image recognition. Previous long-tailed recognition approaches mainly focus on data augmentation or re-balancing strategies for the tail classes to give them more attention during model training. However, these methods are limited by the small number of training images for the tail classes, which results in poor feature representations. To address this issue, we propose the Latent Categories based long-tail Recognition (LCReg) method. Our hypothesis is that common latent features shared by head and tail classes can be used to improve feature representation. Specifically, we learn a set of class-agnostic latent features shared by both head and tail classes, and then use semantic data augmentation on the latent features to implicitly increase the diversity of the training sample. We conduct extensive experiments on five long-tailed image recognition datasets, and the results show that our proposed method significantly improves the baselines. Agency for Science, Technology and Research (A*STAR) Ministry of Education (MOE) Submitted/Accepted version This research is supported by the Ministry of Education, Singapore, under its Academic Research Fund Tier 1 (RG95/20). This research is also supported by the Agency for Science, Technology and Research (A*STAR) under its AME Programmatic Funds (Grant No. A20H6b0151). 2023-10-09T03:01:50Z 2023-10-09T03:01:50Z 2024 Journal Article Liu, W., Wu, Z., Wang, Y., Ding, H., Liu, F., Lin, J. & Lin, G. (2024). LCReg: long-tailed image classification with latent categories based recognition. Pattern Recognition, 145, 109971-. https://dx.doi.org/10.1016/j.patcog.2023.109971 0031-3203 https://hdl.handle.net/10356/170932 10.1016/j.patcog.2023.109971 2-s2.0-85171971903 145 109971 en RG95/20 A20H6b0151 Pattern Recognition © 2023 Elsevier Ltd. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder. The Version of Record is available online at http://doi.org/10.1016/j.patcog.2023.10. application/pdf
spellingShingle Engineering::Computer science and engineering
Long-Tailed
Image Classification
Liu, Weide
Wu, Zhonghua
Wang, Yiming
Ding, Henghui
Liu, Fayao
Lin, Jie
Lin, Guosheng
LCReg: long-tailed image classification with latent categories based recognition
title LCReg: long-tailed image classification with latent categories based recognition
title_full LCReg: long-tailed image classification with latent categories based recognition
title_fullStr LCReg: long-tailed image classification with latent categories based recognition
title_full_unstemmed LCReg: long-tailed image classification with latent categories based recognition
title_short LCReg: long-tailed image classification with latent categories based recognition
title_sort lcreg long tailed image classification with latent categories based recognition
topic Engineering::Computer science and engineering
Long-Tailed
Image Classification
url https://hdl.handle.net/10356/170932
work_keys_str_mv AT liuweide lcreglongtailedimageclassificationwithlatentcategoriesbasedrecognition
AT wuzhonghua lcreglongtailedimageclassificationwithlatentcategoriesbasedrecognition
AT wangyiming lcreglongtailedimageclassificationwithlatentcategoriesbasedrecognition
AT dinghenghui lcreglongtailedimageclassificationwithlatentcategoriesbasedrecognition
AT liufayao lcreglongtailedimageclassificationwithlatentcategoriesbasedrecognition
AT linjie lcreglongtailedimageclassificationwithlatentcategoriesbasedrecognition
AT linguosheng lcreglongtailedimageclassificationwithlatentcategoriesbasedrecognition