A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion
Abstract Entity synonyms play a significant role in entity-based tasks. Previous approaches use linguistic syntax, distributional, and semantic features to expand entity synonym sets from text corpora. Due to the flexibility and complexity of the Chinese language expression, the aforementioned appro...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Springer
2023-04-01
|
Series: | Complex & Intelligent Systems |
Subjects: | |
Online Access: | https://doi.org/10.1007/s40747-023-01064-w |
_version_ | 1827808158118051840 |
---|---|
author | Subin Huang Yu Xiu Jun Li Sanmin Liu Chao Kong |
author_facet | Subin Huang Yu Xiu Jun Li Sanmin Liu Chao Kong |
author_sort | Subin Huang |
collection | DOAJ |
description | Abstract Entity synonyms play a significant role in entity-based tasks. Previous approaches use linguistic syntax, distributional, and semantic features to expand entity synonym sets from text corpora. Due to the flexibility and complexity of the Chinese language expression, the aforementioned approaches are still difficult to expand entity synonym sets robustly from Chinese text, because these approaches fail to track holistic semantics among entities and suffer from error propagation. This paper introduces an approach for expanding Chinese entity synonym sets based on bilateral context and filtering strategy. Specifically, the approach consists of two novel components. First, a bilateral-context-based Siamese network classifier is proposed to determine whether a new entity should be inserted into the existing entity synonym set. The classifier tracks the holistic semantics of bilateral contexts and is capable of imposing soft holistic semantic constraints to improve synonym prediction. Second, a filtering-strategy-based set expansion algorithm is presented to generate Chinese entity synonym sets. The filtering strategy enhances semantic and domain consistencies to filter out wrong synonym entities, thereby mitigating error propagation. Experimental results on two Chinese real-world datasets demonstrate that the proposed approach is effective and outperforms the selected existing state-of-the-art approaches to the Chinese entity synonym set expansion task. |
first_indexed | 2024-03-11T22:08:18Z |
format | Article |
id | doaj.art-ce4810a5d84f4cdd9d2d4610db8a1707 |
institution | Directory Open Access Journal |
issn | 2199-4536 2198-6053 |
language | English |
last_indexed | 2024-03-11T22:08:18Z |
publishDate | 2023-04-01 |
publisher | Springer |
record_format | Article |
series | Complex & Intelligent Systems |
spelling | doaj.art-ce4810a5d84f4cdd9d2d4610db8a17072023-09-24T11:34:58ZengSpringerComplex & Intelligent Systems2199-45362198-60532023-04-01956065608510.1007/s40747-023-01064-wA bilateral context and filtering strategy-based approach to Chinese entity synonym set expansionSubin Huang0Yu Xiu1Jun Li2Sanmin Liu3Chao Kong4School of Computer and Information, Anhui Polytechnic UniversitySchool of Computer and Information, Anhui Polytechnic UniversitySchool of Computer and Information, Anhui Polytechnic UniversitySchool of Computer and Information, Anhui Polytechnic UniversitySchool of Computer and Information, Anhui Polytechnic UniversityAbstract Entity synonyms play a significant role in entity-based tasks. Previous approaches use linguistic syntax, distributional, and semantic features to expand entity synonym sets from text corpora. Due to the flexibility and complexity of the Chinese language expression, the aforementioned approaches are still difficult to expand entity synonym sets robustly from Chinese text, because these approaches fail to track holistic semantics among entities and suffer from error propagation. This paper introduces an approach for expanding Chinese entity synonym sets based on bilateral context and filtering strategy. Specifically, the approach consists of two novel components. First, a bilateral-context-based Siamese network classifier is proposed to determine whether a new entity should be inserted into the existing entity synonym set. The classifier tracks the holistic semantics of bilateral contexts and is capable of imposing soft holistic semantic constraints to improve synonym prediction. Second, a filtering-strategy-based set expansion algorithm is presented to generate Chinese entity synonym sets. The filtering strategy enhances semantic and domain consistencies to filter out wrong synonym entities, thereby mitigating error propagation. Experimental results on two Chinese real-world datasets demonstrate that the proposed approach is effective and outperforms the selected existing state-of-the-art approaches to the Chinese entity synonym set expansion task.https://doi.org/10.1007/s40747-023-01064-wSynonym set expansionSiamese networkBilateral contextFiltering strategy |
spellingShingle | Subin Huang Yu Xiu Jun Li Sanmin Liu Chao Kong A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion Complex & Intelligent Systems Synonym set expansion Siamese network Bilateral context Filtering strategy |
title | A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion |
title_full | A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion |
title_fullStr | A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion |
title_full_unstemmed | A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion |
title_short | A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion |
title_sort | bilateral context and filtering strategy based approach to chinese entity synonym set expansion |
topic | Synonym set expansion Siamese network Bilateral context Filtering strategy |
url | https://doi.org/10.1007/s40747-023-01064-w |
work_keys_str_mv | AT subinhuang abilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion AT yuxiu abilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion AT junli abilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion AT sanminliu abilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion AT chaokong abilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion AT subinhuang bilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion AT yuxiu bilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion AT junli bilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion AT sanminliu bilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion AT chaokong bilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion |