A Sememe Prediction Method Based on the Central Word of a Semantic Field
A “sememe” is an indivisible minimal unit of meaning in linguistics. Manually annotating sememes in words requires a significant amount of time, so automated sememe prediction is often used to improve efficiency. Semantic fields serve as crucial mediators connecting the semantics between words. This...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2024-01-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/13/2/413 |
_version_ | 1797344187053506560 |
---|---|
author | Guanran Luo Yunpeng Cui |
author_facet | Guanran Luo Yunpeng Cui |
author_sort | Guanran Luo |
collection | DOAJ |
description | A “sememe” is an indivisible minimal unit of meaning in linguistics. Manually annotating sememes in words requires a significant amount of time, so automated sememe prediction is often used to improve efficiency. Semantic fields serve as crucial mediators connecting the semantics between words. This paper proposes an unsupervised method for sememe prediction based on the common semantics between words and semantic fields. In comparison to methods based on word vectors, this approach demonstrates a superior ability to align the semantics of words and sememes. We construct various types of semantic fields through ChatGPT and design a semantic field selection strategy to adapt to different scenario requirements. Subsequently, following the order of word–sense–sememe, we decompose the process of calculating the semantic sememe similarity between semantic fields and target words. Finally, we select the word with the highest average semantic sememe similarity as the central word of the semantic field, using its semantic primes as the predicted result. On the BabelSememe dataset constructed based on the sememe knowledge base HowNet, the method of semantic field central word (SFCW) achieved the best results for both unstructured and structured sememe prediction tasks, demonstrating the effectiveness of this approach. Additionally, we conducted qualitative and quantitative analyses on the sememe structure of the central word. |
first_indexed | 2024-03-08T10:59:41Z |
format | Article |
id | doaj.art-f263470bb1d64c61b0cd2113d0e29bdc |
institution | Directory Open Access Journal |
issn | 2079-9292 |
language | English |
last_indexed | 2024-03-08T10:59:41Z |
publishDate | 2024-01-01 |
publisher | MDPI AG |
record_format | Article |
series | Electronics |
spelling | doaj.art-f263470bb1d64c61b0cd2113d0e29bdc2024-01-26T16:14:45ZengMDPI AGElectronics2079-92922024-01-0113241310.3390/electronics13020413A Sememe Prediction Method Based on the Central Word of a Semantic FieldGuanran Luo0Yunpeng Cui1Agriculture Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, ChinaAgriculture Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, ChinaA “sememe” is an indivisible minimal unit of meaning in linguistics. Manually annotating sememes in words requires a significant amount of time, so automated sememe prediction is often used to improve efficiency. Semantic fields serve as crucial mediators connecting the semantics between words. This paper proposes an unsupervised method for sememe prediction based on the common semantics between words and semantic fields. In comparison to methods based on word vectors, this approach demonstrates a superior ability to align the semantics of words and sememes. We construct various types of semantic fields through ChatGPT and design a semantic field selection strategy to adapt to different scenario requirements. Subsequently, following the order of word–sense–sememe, we decompose the process of calculating the semantic sememe similarity between semantic fields and target words. Finally, we select the word with the highest average semantic sememe similarity as the central word of the semantic field, using its semantic primes as the predicted result. On the BabelSememe dataset constructed based on the sememe knowledge base HowNet, the method of semantic field central word (SFCW) achieved the best results for both unstructured and structured sememe prediction tasks, demonstrating the effectiveness of this approach. Additionally, we conducted qualitative and quantitative analyses on the sememe structure of the central word.https://www.mdpi.com/2079-9292/13/2/413sememe predictionsemantic fieldHowNet |
spellingShingle | Guanran Luo Yunpeng Cui A Sememe Prediction Method Based on the Central Word of a Semantic Field Electronics sememe prediction semantic field HowNet |
title | A Sememe Prediction Method Based on the Central Word of a Semantic Field |
title_full | A Sememe Prediction Method Based on the Central Word of a Semantic Field |
title_fullStr | A Sememe Prediction Method Based on the Central Word of a Semantic Field |
title_full_unstemmed | A Sememe Prediction Method Based on the Central Word of a Semantic Field |
title_short | A Sememe Prediction Method Based on the Central Word of a Semantic Field |
title_sort | sememe prediction method based on the central word of a semantic field |
topic | sememe prediction semantic field HowNet |
url | https://www.mdpi.com/2079-9292/13/2/413 |
work_keys_str_mv | AT guanranluo asememepredictionmethodbasedonthecentralwordofasemanticfield AT yunpengcui asememepredictionmethodbasedonthecentralwordofasemanticfield AT guanranluo sememepredictionmethodbasedonthecentralwordofasemanticfield AT yunpengcui sememepredictionmethodbasedonthecentralwordofasemanticfield |