Improving Domain-Generalized Few-Shot Text Classification with Multi-Level Distributional Signatures

Domain-generalized few-shot text classification (DG-FSTC) is a new setting for few-shot text classification (FSTC). In DG-FSTC, the model is meta-trained on a multi-domain dataset, and meta-tested on unseen datasets with different domains. However, previous methods mostly construct semantic represen...

Full description

Bibliographic Details
Main Authors: Xuyang Wang, Yajun Du, Danroujing Chen, Xianyong Li, Xiaoliang Chen, Yongquan Fan, Chunzhi Xie, Yanli Li, Jia Liu
Format: Article
Language:English
Published: MDPI AG 2023-01-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/2/1202
_version_ 1797446454755721216
author Xuyang Wang
Yajun Du
Danroujing Chen
Xianyong Li
Xiaoliang Chen
Yongquan Fan
Chunzhi Xie
Yanli Li
Jia Liu
author_facet Xuyang Wang
Yajun Du
Danroujing Chen
Xianyong Li
Xiaoliang Chen
Yongquan Fan
Chunzhi Xie
Yanli Li
Jia Liu
author_sort Xuyang Wang
collection DOAJ
description Domain-generalized few-shot text classification (DG-FSTC) is a new setting for few-shot text classification (FSTC). In DG-FSTC, the model is meta-trained on a multi-domain dataset, and meta-tested on unseen datasets with different domains. However, previous methods mostly construct semantic representations by learning from words directly, which is limited in domain adaptability. In this study, we enhance the domain adaptability of the model by utilizing the distributional signatures of texts that indicate domain-related features in specific domains. We propose a <b>Multi</b>-level <b>D</b>istributional <b>S</b>ignatures based model, namely MultiDS. Firstly, inspired by pretrained language models, we compute distributional signatures from an extra large news corpus, and we denote these as domain-agnostic features. Then we calculate the distributional signatures from texts in the same domain and texts from the same class, respectively. These two kinds of information are regarded as domain-specific and class-specific features, respectively. After that, we fuse and translate these three distributional signatures into word-level attention values, which enables the model to capture informative features as domain changes. In addition, we utilize domain-specific distributional signatures for the calibration of feature representations in specific domains. The calibration vectors produced by the domain-specific distributional signatures and word embeddings help the model adapt to various domains. Extensive experiments are performed on four benchmarks. The results demonstrate that our proposed method beats the state-of-the-art method with an average improvement of 1.41% on four datasets. Compared with five competitive baselines, our method achieves the best average performance. The ablation studies prove the effectiveness of each proposed module.
first_indexed 2024-03-09T13:40:45Z
format Article
id doaj.art-ccfa7f84a4794873a2f85726f856c434
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-09T13:40:45Z
publishDate 2023-01-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-ccfa7f84a4794873a2f85726f856c4342023-11-30T21:07:38ZengMDPI AGApplied Sciences2076-34172023-01-01132120210.3390/app13021202Improving Domain-Generalized Few-Shot Text Classification with Multi-Level Distributional SignaturesXuyang Wang0Yajun Du1Danroujing Chen2Xianyong Li3Xiaoliang Chen4Yongquan Fan5Chunzhi Xie6Yanli Li7Jia Liu8School of Computer and Software Engineering, Xihua University, Chengdu 610039, ChinaSchool of Computer and Software Engineering, Xihua University, Chengdu 610039, ChinaSchool of Computer and Software Engineering, Xihua University, Chengdu 610039, ChinaSchool of Computer and Software Engineering, Xihua University, Chengdu 610039, ChinaSchool of Computer and Software Engineering, Xihua University, Chengdu 610039, ChinaSchool of Computer and Software Engineering, Xihua University, Chengdu 610039, ChinaSchool of Computer and Software Engineering, Xihua University, Chengdu 610039, ChinaSchool of Computer and Software Engineering, Xihua University, Chengdu 610039, ChinaSchool of Computer and Software Engineering, Xihua University, Chengdu 610039, ChinaDomain-generalized few-shot text classification (DG-FSTC) is a new setting for few-shot text classification (FSTC). In DG-FSTC, the model is meta-trained on a multi-domain dataset, and meta-tested on unseen datasets with different domains. However, previous methods mostly construct semantic representations by learning from words directly, which is limited in domain adaptability. In this study, we enhance the domain adaptability of the model by utilizing the distributional signatures of texts that indicate domain-related features in specific domains. We propose a <b>Multi</b>-level <b>D</b>istributional <b>S</b>ignatures based model, namely MultiDS. Firstly, inspired by pretrained language models, we compute distributional signatures from an extra large news corpus, and we denote these as domain-agnostic features. Then we calculate the distributional signatures from texts in the same domain and texts from the same class, respectively. These two kinds of information are regarded as domain-specific and class-specific features, respectively. After that, we fuse and translate these three distributional signatures into word-level attention values, which enables the model to capture informative features as domain changes. In addition, we utilize domain-specific distributional signatures for the calibration of feature representations in specific domains. The calibration vectors produced by the domain-specific distributional signatures and word embeddings help the model adapt to various domains. Extensive experiments are performed on four benchmarks. The results demonstrate that our proposed method beats the state-of-the-art method with an average improvement of 1.41% on four datasets. Compared with five competitive baselines, our method achieves the best average performance. The ablation studies prove the effectiveness of each proposed module.https://www.mdpi.com/2076-3417/13/2/1202domain-generalized few-shot learningtext classificationdistributional signaturemeta-learning
spellingShingle Xuyang Wang
Yajun Du
Danroujing Chen
Xianyong Li
Xiaoliang Chen
Yongquan Fan
Chunzhi Xie
Yanli Li
Jia Liu
Improving Domain-Generalized Few-Shot Text Classification with Multi-Level Distributional Signatures
Applied Sciences
domain-generalized few-shot learning
text classification
distributional signature
meta-learning
title Improving Domain-Generalized Few-Shot Text Classification with Multi-Level Distributional Signatures
title_full Improving Domain-Generalized Few-Shot Text Classification with Multi-Level Distributional Signatures
title_fullStr Improving Domain-Generalized Few-Shot Text Classification with Multi-Level Distributional Signatures
title_full_unstemmed Improving Domain-Generalized Few-Shot Text Classification with Multi-Level Distributional Signatures
title_short Improving Domain-Generalized Few-Shot Text Classification with Multi-Level Distributional Signatures
title_sort improving domain generalized few shot text classification with multi level distributional signatures
topic domain-generalized few-shot learning
text classification
distributional signature
meta-learning
url https://www.mdpi.com/2076-3417/13/2/1202
work_keys_str_mv AT xuyangwang improvingdomaingeneralizedfewshottextclassificationwithmultileveldistributionalsignatures
AT yajundu improvingdomaingeneralizedfewshottextclassificationwithmultileveldistributionalsignatures
AT danroujingchen improvingdomaingeneralizedfewshottextclassificationwithmultileveldistributionalsignatures
AT xianyongli improvingdomaingeneralizedfewshottextclassificationwithmultileveldistributionalsignatures
AT xiaoliangchen improvingdomaingeneralizedfewshottextclassificationwithmultileveldistributionalsignatures
AT yongquanfan improvingdomaingeneralizedfewshottextclassificationwithmultileveldistributionalsignatures
AT chunzhixie improvingdomaingeneralizedfewshottextclassificationwithmultileveldistributionalsignatures
AT yanlili improvingdomaingeneralizedfewshottextclassificationwithmultileveldistributionalsignatures
AT jialiu improvingdomaingeneralizedfewshottextclassificationwithmultileveldistributionalsignatures