A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion

Abstract Entity synonyms play a significant role in entity-based tasks. Previous approaches use linguistic syntax, distributional, and semantic features to expand entity synonym sets from text corpora. Due to the flexibility and complexity of the Chinese language expression, the aforementioned appro...

Full description

Bibliographic Details
Main Authors: Subin Huang, Yu Xiu, Jun Li, Sanmin Liu, Chao Kong
Format: Article
Language:English
Published: Springer 2023-04-01
Series:Complex & Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1007/s40747-023-01064-w
_version_ 1797675065243860992
author Subin Huang
Yu Xiu
Jun Li
Sanmin Liu
Chao Kong
author_facet Subin Huang
Yu Xiu
Jun Li
Sanmin Liu
Chao Kong
author_sort Subin Huang
collection DOAJ
description Abstract Entity synonyms play a significant role in entity-based tasks. Previous approaches use linguistic syntax, distributional, and semantic features to expand entity synonym sets from text corpora. Due to the flexibility and complexity of the Chinese language expression, the aforementioned approaches are still difficult to expand entity synonym sets robustly from Chinese text, because these approaches fail to track holistic semantics among entities and suffer from error propagation. This paper introduces an approach for expanding Chinese entity synonym sets based on bilateral context and filtering strategy. Specifically, the approach consists of two novel components. First, a bilateral-context-based Siamese network classifier is proposed to determine whether a new entity should be inserted into the existing entity synonym set. The classifier tracks the holistic semantics of bilateral contexts and is capable of imposing soft holistic semantic constraints to improve synonym prediction. Second, a filtering-strategy-based set expansion algorithm is presented to generate Chinese entity synonym sets. The filtering strategy enhances semantic and domain consistencies to filter out wrong synonym entities, thereby mitigating error propagation. Experimental results on two Chinese real-world datasets demonstrate that the proposed approach is effective and outperforms the selected existing state-of-the-art approaches to the Chinese entity synonym set expansion task.
first_indexed 2024-03-11T22:08:18Z
format Article
id doaj.art-ce4810a5d84f4cdd9d2d4610db8a1707
institution Directory Open Access Journal
issn 2199-4536
2198-6053
language English
last_indexed 2024-03-11T22:08:18Z
publishDate 2023-04-01
publisher Springer
record_format Article
series Complex & Intelligent Systems
spelling doaj.art-ce4810a5d84f4cdd9d2d4610db8a17072023-09-24T11:34:58ZengSpringerComplex & Intelligent Systems2199-45362198-60532023-04-01956065608510.1007/s40747-023-01064-wA bilateral context and filtering strategy-based approach to Chinese entity synonym set expansionSubin Huang0Yu Xiu1Jun Li2Sanmin Liu3Chao Kong4School of Computer and Information, Anhui Polytechnic UniversitySchool of Computer and Information, Anhui Polytechnic UniversitySchool of Computer and Information, Anhui Polytechnic UniversitySchool of Computer and Information, Anhui Polytechnic UniversitySchool of Computer and Information, Anhui Polytechnic UniversityAbstract Entity synonyms play a significant role in entity-based tasks. Previous approaches use linguistic syntax, distributional, and semantic features to expand entity synonym sets from text corpora. Due to the flexibility and complexity of the Chinese language expression, the aforementioned approaches are still difficult to expand entity synonym sets robustly from Chinese text, because these approaches fail to track holistic semantics among entities and suffer from error propagation. This paper introduces an approach for expanding Chinese entity synonym sets based on bilateral context and filtering strategy. Specifically, the approach consists of two novel components. First, a bilateral-context-based Siamese network classifier is proposed to determine whether a new entity should be inserted into the existing entity synonym set. The classifier tracks the holistic semantics of bilateral contexts and is capable of imposing soft holistic semantic constraints to improve synonym prediction. Second, a filtering-strategy-based set expansion algorithm is presented to generate Chinese entity synonym sets. The filtering strategy enhances semantic and domain consistencies to filter out wrong synonym entities, thereby mitigating error propagation. Experimental results on two Chinese real-world datasets demonstrate that the proposed approach is effective and outperforms the selected existing state-of-the-art approaches to the Chinese entity synonym set expansion task.https://doi.org/10.1007/s40747-023-01064-wSynonym set expansionSiamese networkBilateral contextFiltering strategy
spellingShingle Subin Huang
Yu Xiu
Jun Li
Sanmin Liu
Chao Kong
A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion
Complex & Intelligent Systems
Synonym set expansion
Siamese network
Bilateral context
Filtering strategy
title A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion
title_full A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion
title_fullStr A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion
title_full_unstemmed A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion
title_short A bilateral context and filtering strategy-based approach to Chinese entity synonym set expansion
title_sort bilateral context and filtering strategy based approach to chinese entity synonym set expansion
topic Synonym set expansion
Siamese network
Bilateral context
Filtering strategy
url https://doi.org/10.1007/s40747-023-01064-w
work_keys_str_mv AT subinhuang abilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion
AT yuxiu abilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion
AT junli abilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion
AT sanminliu abilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion
AT chaokong abilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion
AT subinhuang bilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion
AT yuxiu bilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion
AT junli bilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion
AT sanminliu bilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion
AT chaokong bilateralcontextandfilteringstrategybasedapproachtochineseentitysynonymsetexpansion