Semantic pattern recognition of special English words based on data corpus technology

This paper first researches special English vocabulary teaching defines special English vocabulary definitions and characteristics based on special English usage, and proposes strategies for learning special English vocabulary. The study involves studying the semantic recognition of special English...

Full description

Bibliographic Details
Main Author: He Qingxia
Format: Article
Language:English
Published: Sciendo 2024-01-01
Series:Applied Mathematics and Nonlinear Sciences
Subjects:
Online Access:https://doi.org/10.2478/amns.2023.2.01037
_version_ 1797340633111724032
author He Qingxia
author_facet He Qingxia
author_sort He Qingxia
collection DOAJ
description This paper first researches special English vocabulary teaching defines special English vocabulary definitions and characteristics based on special English usage, and proposes strategies for learning special English vocabulary. The study involves studying the semantic recognition of special English using data corpus technology. Based on data corpus technology, the semantic structure of English vocabulary is analyzed according to Chinese analysis and word vectors, and to facilitate the extraction of English semantic features, IOB2 labeled training corpus is needed. Then, the CRFs network algorithm in the data corpus technology is used to construct the special English semantic recognition model, the lexical annotation set is refined on the preprocessing, and at the same time, the semantic features are added to the recognition model of CRFs mainly used to recognize the noun phrases of gerund class and the experimental analysis is carried out on the recognition of the semantic patterns of the special English vocabulary. The results show that among high school students with high English semantic recognition ability, the correlation coefficient between their aggregates and reading level is 0.753, the correlation coefficient between collocated phrases and reading level is 0.772, and there is a high correlation between the ability to recognize aggregates, collocated phrases, and sentence framing phrases and reading level. This study is designed to improve the efficiency of English reading and communication and has a facilitatory effect on recognizing English lexical semantic patterns.
first_indexed 2024-03-08T10:05:58Z
format Article
id doaj.art-ab267592d49842729fd5f85c072455e6
institution Directory Open Access Journal
issn 2444-8656
language English
last_indexed 2024-03-08T10:05:58Z
publishDate 2024-01-01
publisher Sciendo
record_format Article
series Applied Mathematics and Nonlinear Sciences
spelling doaj.art-ab267592d49842729fd5f85c072455e62024-01-29T08:52:38ZengSciendoApplied Mathematics and Nonlinear Sciences2444-86562024-01-019110.2478/amns.2023.2.01037Semantic pattern recognition of special English words based on data corpus technologyHe Qingxia01School of Foreign Languages, Sanjiang University, Nanjing, Jiangsu, 210012, China.This paper first researches special English vocabulary teaching defines special English vocabulary definitions and characteristics based on special English usage, and proposes strategies for learning special English vocabulary. The study involves studying the semantic recognition of special English using data corpus technology. Based on data corpus technology, the semantic structure of English vocabulary is analyzed according to Chinese analysis and word vectors, and to facilitate the extraction of English semantic features, IOB2 labeled training corpus is needed. Then, the CRFs network algorithm in the data corpus technology is used to construct the special English semantic recognition model, the lexical annotation set is refined on the preprocessing, and at the same time, the semantic features are added to the recognition model of CRFs mainly used to recognize the noun phrases of gerund class and the experimental analysis is carried out on the recognition of the semantic patterns of the special English vocabulary. The results show that among high school students with high English semantic recognition ability, the correlation coefficient between their aggregates and reading level is 0.753, the correlation coefficient between collocated phrases and reading level is 0.772, and there is a high correlation between the ability to recognize aggregates, collocated phrases, and sentence framing phrases and reading level. This study is designed to improve the efficiency of English reading and communication and has a facilitatory effect on recognizing English lexical semantic patterns.https://doi.org/10.2478/amns.2023.2.01037crfs recognition modelword vectorsiob2 annotationdata corpus technologyspecial english vocabularylexical semantics97c50
spellingShingle He Qingxia
Semantic pattern recognition of special English words based on data corpus technology
Applied Mathematics and Nonlinear Sciences
crfs recognition model
word vectors
iob2 annotation
data corpus technology
special english vocabulary
lexical semantics
97c50
title Semantic pattern recognition of special English words based on data corpus technology
title_full Semantic pattern recognition of special English words based on data corpus technology
title_fullStr Semantic pattern recognition of special English words based on data corpus technology
title_full_unstemmed Semantic pattern recognition of special English words based on data corpus technology
title_short Semantic pattern recognition of special English words based on data corpus technology
title_sort semantic pattern recognition of special english words based on data corpus technology
topic crfs recognition model
word vectors
iob2 annotation
data corpus technology
special english vocabulary
lexical semantics
97c50
url https://doi.org/10.2478/amns.2023.2.01037
work_keys_str_mv AT heqingxia semanticpatternrecognitionofspecialenglishwordsbasedondatacorpustechnology