Corpus-based analysis of semi-automatically extracted artificial intelligence-related terminology

Artificial Intelligence (AI), as a multidisciplinary field, combines computer science, robotics and cognitive science, with increasingly growing applications in many diverse areas, such as engineering, business, medicine, weather forecasting, industry, translation, natural language, linguistics, etc...

Full description

Bibliographic Details
Main Authors: Mikelionienė Jurgita, Motiejūnienė Jurgita
Format: Article
Language:English
Published: Sciendo 2021-03-01
Series:Journal of Language and Cultural Education
Subjects:
Online Access:https://doi.org/10.2478/jolace-2021-0003
_version_ 1818500228304601088
author Mikelionienė Jurgita
Motiejūnienė Jurgita
author_facet Mikelionienė Jurgita
Motiejūnienė Jurgita
author_sort Mikelionienė Jurgita
collection DOAJ
description Artificial Intelligence (AI), as a multidisciplinary field, combines computer science, robotics and cognitive science, with increasingly growing applications in many diverse areas, such as engineering, business, medicine, weather forecasting, industry, translation, natural language, linguistics, etc. In Europe, interest in AI has been rising in the last decade. One of the greatest hurdles for researchers in automated processing of technical documentation is large amounts of specific terminology. The aim of this research is to analyse the semi-automatically extracted artificial intelligence-related terminology and the most common phrases related to artificial intelligence in English and Lithuanian in terms of their structure, multidisciplinarity and connotation. For selection and analysis of terms, two programmes were chosen in this study, namely SynchroTerm and SketchEngine. The paper presents the outcomes of an AI terminological project carried out with SynchroTerm and provides an analysis of a special corpus compiled in the field of artificial intelligence using the SketchEngine platform. The analysis of semi-automatic term extraction use and corpus-based techniques for artificial intelligence-related terminology revealed that AI as a specialized domain contains multidisciplinary terminology, and is complex and dynamic. The empiric data shows that the context is essential for the evaluation of the concept under analysis and reveals the different connotation of the term.
first_indexed 2024-12-10T20:39:56Z
format Article
id doaj.art-698b9f42403e4f51a928c4f1a87c722d
institution Directory Open Access Journal
issn 1339-4584
language English
last_indexed 2024-12-10T20:39:56Z
publishDate 2021-03-01
publisher Sciendo
record_format Article
series Journal of Language and Cultural Education
spelling doaj.art-698b9f42403e4f51a928c4f1a87c722d2022-12-22T01:34:23ZengSciendoJournal of Language and Cultural Education1339-45842021-03-0191303810.2478/jolace-2021-0003Corpus-based analysis of semi-automatically extracted artificial intelligence-related terminologyMikelionienė Jurgita0Motiejūnienė Jurgita1Kaunas University of Technology, LithuaniaKaunas University of Technology, LithuaniaArtificial Intelligence (AI), as a multidisciplinary field, combines computer science, robotics and cognitive science, with increasingly growing applications in many diverse areas, such as engineering, business, medicine, weather forecasting, industry, translation, natural language, linguistics, etc. In Europe, interest in AI has been rising in the last decade. One of the greatest hurdles for researchers in automated processing of technical documentation is large amounts of specific terminology. The aim of this research is to analyse the semi-automatically extracted artificial intelligence-related terminology and the most common phrases related to artificial intelligence in English and Lithuanian in terms of their structure, multidisciplinarity and connotation. For selection and analysis of terms, two programmes were chosen in this study, namely SynchroTerm and SketchEngine. The paper presents the outcomes of an AI terminological project carried out with SynchroTerm and provides an analysis of a special corpus compiled in the field of artificial intelligence using the SketchEngine platform. The analysis of semi-automatic term extraction use and corpus-based techniques for artificial intelligence-related terminology revealed that AI as a specialized domain contains multidisciplinary terminology, and is complex and dynamic. The empiric data shows that the context is essential for the evaluation of the concept under analysis and reveals the different connotation of the term.https://doi.org/10.2478/jolace-2021-0003artificial intelligencedomain-specific corpussemi-automatic term extractionterminologycollocates
spellingShingle Mikelionienė Jurgita
Motiejūnienė Jurgita
Corpus-based analysis of semi-automatically extracted artificial intelligence-related terminology
Journal of Language and Cultural Education
artificial intelligence
domain-specific corpus
semi-automatic term extraction
terminology
collocates
title Corpus-based analysis of semi-automatically extracted artificial intelligence-related terminology
title_full Corpus-based analysis of semi-automatically extracted artificial intelligence-related terminology
title_fullStr Corpus-based analysis of semi-automatically extracted artificial intelligence-related terminology
title_full_unstemmed Corpus-based analysis of semi-automatically extracted artificial intelligence-related terminology
title_short Corpus-based analysis of semi-automatically extracted artificial intelligence-related terminology
title_sort corpus based analysis of semi automatically extracted artificial intelligence related terminology
topic artificial intelligence
domain-specific corpus
semi-automatic term extraction
terminology
collocates
url https://doi.org/10.2478/jolace-2021-0003
work_keys_str_mv AT mikelionienejurgita corpusbasedanalysisofsemiautomaticallyextractedartificialintelligencerelatedterminology
AT motiejunienejurgita corpusbasedanalysisofsemiautomaticallyextractedartificialintelligencerelatedterminology