Multi-Scale Decision Network With Feature Fusion and Weighting for Few-Shot Learning

Learning from limited labelled examples is key a research hotspot with excellent scenarios and potential applications. Currently, most of metric learning-based few-shot models still have the problem of low recognition accuracy. This is mainly because that they only use the top-layer abstract feature...

Full description

Bibliographic Details
Main Authors:	Xiaoru Wang, Bing Ma, Zhihong Yu, Fu Li, Yali Cai
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Feature fusion feature weighting few-shot learning image recognition multi-scale feature
Online Access:	https://ieeexplore.ieee.org/document/9093896/

_version_	1819176177349165056
author	Xiaoru Wang Bing Ma Zhihong Yu Fu Li Yali Cai
author_facet	Xiaoru Wang Bing Ma Zhihong Yu Fu Li Yali Cai
author_sort	Xiaoru Wang
collection	DOAJ
description	Learning from limited labelled examples is key a research hotspot with excellent scenarios and potential applications. Currently, most of metric learning-based few-shot models still have the problem of low recognition accuracy. This is mainly because that they only use the top-layer abstract feature with semantic information, which ignores the low-layer features that are also critical for the few-shot recognition. Therefore, the extracted features do not have abundant representation ability, and it is difficult to recognize easily confusing objects. Moreover, they usually adopt a fixed distance function or train a comparable network to measure features. These methods lack adaptability, cannot sufficiently fuse features, which leads to weaken the fitting ability of the metric function. And the same or different classes of images are treated equally, which makes the metric function have no emphasis point during training. To address these issues, we propose an end-to-end, metric learning-based model in this paper, called multi-scale decision network with feature fusion and weighting for few-shot learning (MSDN). Considering the importance of the low-layer features, we exploit a convolutional network to extract each layer feature. Then, we exploit a relation network to learn a non-linear metric between the support set and the query set features of each layer and classify the test images via a voting decision. During feature concatenation, we design a non-linear feature fusion item to improve the way of concatenation, so that the relation network can have a stronger function fitting ability to learn the relation score. Meanwhile, we introduce the attention mechanism by calculating the cosine similarity between the support set and the query set features as their weight, which makes the relation network pay more attention to the same class of images. Our model achieves the state-of-the-art accuracy result on Omniglot and miniImageNet datasets compared with popular few-shot recognition models.
first_indexed	2024-12-22T21:06:37Z
format	Article
id	doaj.art-8114189208b0406b8bd6df3d5ec20292
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-12-22T21:06:37Z
publishDate	2020-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-8114189208b0406b8bd6df3d5ec202922022-12-21T18:12:39ZengIEEEIEEE Access2169-35362020-01-018921729218110.1109/ACCESS.2020.29948059093896Multi-Scale Decision Network With Feature Fusion and Weighting for Few-Shot LearningXiaoru Wang0https://orcid.org/0000-0001-7171-2783Bing Ma1https://orcid.org/0000-0002-9856-3762Zhihong Yu2https://orcid.org/0000-0002-6379-1976Fu Li3https://orcid.org/0000-0001-8819-0547Yali Cai4https://orcid.org/0000-0002-8323-8182Beijing Key Laboratory of Network System and Network Culture, Beijing University of Posts and Telecommunications, Beijing, ChinaBeijing Key Laboratory of Network System and Network Culture, Beijing University of Posts and Telecommunications, Beijing, ChinaIntel China Research Center, Beijing, ChinaDepartment of Electrical and Computer Engineering, Portland State University, Portland, OR, USABeijing Key Laboratory of Network System and Network Culture, Beijing University of Posts and Telecommunications, Beijing, ChinaLearning from limited labelled examples is key a research hotspot with excellent scenarios and potential applications. Currently, most of metric learning-based few-shot models still have the problem of low recognition accuracy. This is mainly because that they only use the top-layer abstract feature with semantic information, which ignores the low-layer features that are also critical for the few-shot recognition. Therefore, the extracted features do not have abundant representation ability, and it is difficult to recognize easily confusing objects. Moreover, they usually adopt a fixed distance function or train a comparable network to measure features. These methods lack adaptability, cannot sufficiently fuse features, which leads to weaken the fitting ability of the metric function. And the same or different classes of images are treated equally, which makes the metric function have no emphasis point during training. To address these issues, we propose an end-to-end, metric learning-based model in this paper, called multi-scale decision network with feature fusion and weighting for few-shot learning (MSDN). Considering the importance of the low-layer features, we exploit a convolutional network to extract each layer feature. Then, we exploit a relation network to learn a non-linear metric between the support set and the query set features of each layer and classify the test images via a voting decision. During feature concatenation, we design a non-linear feature fusion item to improve the way of concatenation, so that the relation network can have a stronger function fitting ability to learn the relation score. Meanwhile, we introduce the attention mechanism by calculating the cosine similarity between the support set and the query set features as their weight, which makes the relation network pay more attention to the same class of images. Our model achieves the state-of-the-art accuracy result on Omniglot and miniImageNet datasets compared with popular few-shot recognition models.https://ieeexplore.ieee.org/document/9093896/Feature fusionfeature weightingfew-shot learningimage recognitionmulti-scale feature
spellingShingle	Xiaoru Wang Bing Ma Zhihong Yu Fu Li Yali Cai Multi-Scale Decision Network With Feature Fusion and Weighting for Few-Shot Learning IEEE Access Feature fusion feature weighting few-shot learning image recognition multi-scale feature
title	Multi-Scale Decision Network With Feature Fusion and Weighting for Few-Shot Learning
title_full	Multi-Scale Decision Network With Feature Fusion and Weighting for Few-Shot Learning
title_fullStr	Multi-Scale Decision Network With Feature Fusion and Weighting for Few-Shot Learning
title_full_unstemmed	Multi-Scale Decision Network With Feature Fusion and Weighting for Few-Shot Learning
title_short	Multi-Scale Decision Network With Feature Fusion and Weighting for Few-Shot Learning
title_sort	multi scale decision network with feature fusion and weighting for few shot learning
topic	Feature fusion feature weighting few-shot learning image recognition multi-scale feature
url	https://ieeexplore.ieee.org/document/9093896/
work_keys_str_mv	AT xiaoruwang multiscaledecisionnetworkwithfeaturefusionandweightingforfewshotlearning AT bingma multiscaledecisionnetworkwithfeaturefusionandweightingforfewshotlearning AT zhihongyu multiscaledecisionnetworkwithfeaturefusionandweightingforfewshotlearning AT fuli multiscaledecisionnetworkwithfeaturefusionandweightingforfewshotlearning AT yalicai multiscaledecisionnetworkwithfeaturefusionandweightingforfewshotlearning

Multi-Scale Decision Network With Feature Fusion and Weighting for Few-Shot Learning

Similar Items