A Sememe Prediction Method Based on the Central Word of a Semantic Field

A “sememe” is an indivisible minimal unit of meaning in linguistics. Manually annotating sememes in words requires a significant amount of time, so automated sememe prediction is often used to improve efficiency. Semantic fields serve as crucial mediators connecting the semantics between words. This...

Full description

Bibliographic Details
Main Authors: Guanran Luo, Yunpeng Cui
Format: Article
Language:English
Published: MDPI AG 2024-01-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/13/2/413
_version_ 1797344187053506560
author Guanran Luo
Yunpeng Cui
author_facet Guanran Luo
Yunpeng Cui
author_sort Guanran Luo
collection DOAJ
description A “sememe” is an indivisible minimal unit of meaning in linguistics. Manually annotating sememes in words requires a significant amount of time, so automated sememe prediction is often used to improve efficiency. Semantic fields serve as crucial mediators connecting the semantics between words. This paper proposes an unsupervised method for sememe prediction based on the common semantics between words and semantic fields. In comparison to methods based on word vectors, this approach demonstrates a superior ability to align the semantics of words and sememes. We construct various types of semantic fields through ChatGPT and design a semantic field selection strategy to adapt to different scenario requirements. Subsequently, following the order of word–sense–sememe, we decompose the process of calculating the semantic sememe similarity between semantic fields and target words. Finally, we select the word with the highest average semantic sememe similarity as the central word of the semantic field, using its semantic primes as the predicted result. On the BabelSememe dataset constructed based on the sememe knowledge base HowNet, the method of semantic field central word (SFCW) achieved the best results for both unstructured and structured sememe prediction tasks, demonstrating the effectiveness of this approach. Additionally, we conducted qualitative and quantitative analyses on the sememe structure of the central word.
first_indexed 2024-03-08T10:59:41Z
format Article
id doaj.art-f263470bb1d64c61b0cd2113d0e29bdc
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-08T10:59:41Z
publishDate 2024-01-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-f263470bb1d64c61b0cd2113d0e29bdc2024-01-26T16:14:45ZengMDPI AGElectronics2079-92922024-01-0113241310.3390/electronics13020413A Sememe Prediction Method Based on the Central Word of a Semantic FieldGuanran Luo0Yunpeng Cui1Agriculture Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, ChinaAgriculture Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, ChinaA “sememe” is an indivisible minimal unit of meaning in linguistics. Manually annotating sememes in words requires a significant amount of time, so automated sememe prediction is often used to improve efficiency. Semantic fields serve as crucial mediators connecting the semantics between words. This paper proposes an unsupervised method for sememe prediction based on the common semantics between words and semantic fields. In comparison to methods based on word vectors, this approach demonstrates a superior ability to align the semantics of words and sememes. We construct various types of semantic fields through ChatGPT and design a semantic field selection strategy to adapt to different scenario requirements. Subsequently, following the order of word–sense–sememe, we decompose the process of calculating the semantic sememe similarity between semantic fields and target words. Finally, we select the word with the highest average semantic sememe similarity as the central word of the semantic field, using its semantic primes as the predicted result. On the BabelSememe dataset constructed based on the sememe knowledge base HowNet, the method of semantic field central word (SFCW) achieved the best results for both unstructured and structured sememe prediction tasks, demonstrating the effectiveness of this approach. Additionally, we conducted qualitative and quantitative analyses on the sememe structure of the central word.https://www.mdpi.com/2079-9292/13/2/413sememe predictionsemantic fieldHowNet
spellingShingle Guanran Luo
Yunpeng Cui
A Sememe Prediction Method Based on the Central Word of a Semantic Field
Electronics
sememe prediction
semantic field
HowNet
title A Sememe Prediction Method Based on the Central Word of a Semantic Field
title_full A Sememe Prediction Method Based on the Central Word of a Semantic Field
title_fullStr A Sememe Prediction Method Based on the Central Word of a Semantic Field
title_full_unstemmed A Sememe Prediction Method Based on the Central Word of a Semantic Field
title_short A Sememe Prediction Method Based on the Central Word of a Semantic Field
title_sort sememe prediction method based on the central word of a semantic field
topic sememe prediction
semantic field
HowNet
url https://www.mdpi.com/2079-9292/13/2/413
work_keys_str_mv AT guanranluo asememepredictionmethodbasedonthecentralwordofasemanticfield
AT yunpengcui asememepredictionmethodbasedonthecentralwordofasemanticfield
AT guanranluo sememepredictionmethodbasedonthecentralwordofasemanticfield
AT yunpengcui sememepredictionmethodbasedonthecentralwordofasemanticfield