Adaptive micro- and macro-knowledge incorporation for hierarchical text classification
Hierarchical text classification (HTC) aims to classify a text into multiple categories organized in a hierarchical structure. The state-of-the-art HTC methods usually employ graph networks, where label graphs are constructed and label representation is learned to interact with text representations...
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Journal Article |
Language: | English |
Published: |
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/175778 |
_version_ | 1811678750646992896 |
---|---|
author | Feng, Zijian Mao, Kezhi Zhou, Hanzhang |
author2 | School of Electrical and Electronic Engineering |
author_facet | School of Electrical and Electronic Engineering Feng, Zijian Mao, Kezhi Zhou, Hanzhang |
author_sort | Feng, Zijian |
collection | NTU |
description | Hierarchical text classification (HTC) aims to classify a text into multiple categories organized in a hierarchical structure. The state-of-the-art HTC methods usually employ graph networks, where label graphs are constructed and label representation is learned to interact with text representations for classification. In general, label graphs are built on the intrinsic label hierarchy, label semantic similarity, or label co-occurrence. Such graphs have been proven to be effective, but they only exploit knowledge from training data or simple label descriptions, without considering the vast external knowledge in the open sources. Actually, external knowledge from open sources could bring in complementary information to enhance the label graph's representation power. Motivated by the above considerations, we explore the use of external knowledge for improving HTC in this paper. We categorize knowledge into micro-knowledge and macro-knowledge, which are defined as the fundamental concepts related to a single class label and the correlations among class labels, respectively. For tailor-made incorporation of the two types of knowledge into representation learning and classification, we propose Adaptive Micro- and Macro-Knowledge Incorporation for Hierarchical Text Classification (AMKI-HTC) model in this paper. The micro-knowledge incorporation helps capture class-relevant keywords in the text and hence produce discriminative representations, while the macro-knowledge incorporation improves the accuracy of label graphs. Finally, a confidence maximization fusion strategy is developed for adaptive aggregation of multi-view features. Extensive experiments on three benchmark HTC datasets demonstrate that AMKI-HTC consistently outperforms state-of-the-art models. |
first_indexed | 2024-10-01T02:58:14Z |
format | Journal Article |
id | ntu-10356/175778 |
institution | Nanyang Technological University |
language | English |
last_indexed | 2024-10-01T02:58:14Z |
publishDate | 2024 |
record_format | dspace |
spelling | ntu-10356/1757782024-05-06T08:07:44Z Adaptive micro- and macro-knowledge incorporation for hierarchical text classification Feng, Zijian Mao, Kezhi Zhou, Hanzhang School of Electrical and Electronic Engineering Interdisciplinary Graduate School (IGS) Future Resilient Systems Programme, Singapore-ETH Centre Engineering Hierarchical text classification Knowledge incorporation Hierarchical text classification (HTC) aims to classify a text into multiple categories organized in a hierarchical structure. The state-of-the-art HTC methods usually employ graph networks, where label graphs are constructed and label representation is learned to interact with text representations for classification. In general, label graphs are built on the intrinsic label hierarchy, label semantic similarity, or label co-occurrence. Such graphs have been proven to be effective, but they only exploit knowledge from training data or simple label descriptions, without considering the vast external knowledge in the open sources. Actually, external knowledge from open sources could bring in complementary information to enhance the label graph's representation power. Motivated by the above considerations, we explore the use of external knowledge for improving HTC in this paper. We categorize knowledge into micro-knowledge and macro-knowledge, which are defined as the fundamental concepts related to a single class label and the correlations among class labels, respectively. For tailor-made incorporation of the two types of knowledge into representation learning and classification, we propose Adaptive Micro- and Macro-Knowledge Incorporation for Hierarchical Text Classification (AMKI-HTC) model in this paper. The micro-knowledge incorporation helps capture class-relevant keywords in the text and hence produce discriminative representations, while the macro-knowledge incorporation improves the accuracy of label graphs. Finally, a confidence maximization fusion strategy is developed for adaptive aggregation of multi-view features. Extensive experiments on three benchmark HTC datasets demonstrate that AMKI-HTC consistently outperforms state-of-the-art models. National Research Foundation (NRF) We would like to acknowledge that this project is a product of the Future Resilient Systems initiative at the Singapore-ETH Centre (SEC), and it has been made possible thanks to the support of the National Research Foundation, Prime Minister’s Office, Singapore, through its Campus for Research Excellence and Technological Enterprise (CREATE) programme. 2024-05-06T08:07:44Z 2024-05-06T08:07:44Z 2024 Journal Article Feng, Z., Mao, K. & Zhou, H. (2024). Adaptive micro- and macro-knowledge incorporation for hierarchical text classification. Expert Systems With Applications, 248, 123374-. https://dx.doi.org/10.1016/j.eswa.2024.123374 0957-4174 https://hdl.handle.net/10356/175778 10.1016/j.eswa.2024.123374 2-s2.0-85185280823 248 123374 en CREATE Expert Systems with Applications © 2024 Elsevier Ltd. All rights reserved. |
spellingShingle | Engineering Hierarchical text classification Knowledge incorporation Feng, Zijian Mao, Kezhi Zhou, Hanzhang Adaptive micro- and macro-knowledge incorporation for hierarchical text classification |
title | Adaptive micro- and macro-knowledge incorporation for hierarchical text classification |
title_full | Adaptive micro- and macro-knowledge incorporation for hierarchical text classification |
title_fullStr | Adaptive micro- and macro-knowledge incorporation for hierarchical text classification |
title_full_unstemmed | Adaptive micro- and macro-knowledge incorporation for hierarchical text classification |
title_short | Adaptive micro- and macro-knowledge incorporation for hierarchical text classification |
title_sort | adaptive micro and macro knowledge incorporation for hierarchical text classification |
topic | Engineering Hierarchical text classification Knowledge incorporation |
url | https://hdl.handle.net/10356/175778 |
work_keys_str_mv | AT fengzijian adaptivemicroandmacroknowledgeincorporationforhierarchicaltextclassification AT maokezhi adaptivemicroandmacroknowledgeincorporationforhierarchicaltextclassification AT zhouhanzhang adaptivemicroandmacroknowledgeincorporationforhierarchicaltextclassification |