A Data Driven Approach for Raw Material Terminology
The research presented in this paper aims at creating a bilingual (sr-en), easily searchable, hypertext, born-digital, corpus-based terminological database of raw material terminology for dictionary production. The approach is based on linking dictionaries related to the raw material domain, both di...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-03-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/11/7/2892 |
_version_ | 1797540261855756288 |
---|---|
author | Olivera Kitanović Ranka Stanković Aleksandra Tomašević Mihailo Škorić Ivan Babić Ljiljana Kolonja |
author_facet | Olivera Kitanović Ranka Stanković Aleksandra Tomašević Mihailo Škorić Ivan Babić Ljiljana Kolonja |
author_sort | Olivera Kitanović |
collection | DOAJ |
description | The research presented in this paper aims at creating a bilingual (sr-en), easily searchable, hypertext, born-digital, corpus-based terminological database of raw material terminology for dictionary production. The approach is based on linking dictionaries related to the raw material domain, both digitally born and printed, into a lexicon structure, aligning terminology from different dictionaries as much as possible. This paper presents the main features of this approach, data used for compilation of the terminological database, the procedure by which it has been generated and a mobile application for its use. Available (terminological) resources will be presented—paper dictionaries and digital resources related to the raw material domain, as well as general lexica morphological dictionaries. Resource preparation started with dictionary (retro)digitisation and corpora enlargement, followed by adding new Serbian terms to general lexica dictionaries, as well as adding bilingual terms. Dictionary development is relying on corpus analysis, details of which are also presented. Usage examples, collocations and concordances play an important role in raw material terminology, and have also been included in this research. Some important related issues discussed are collocation extraction methods, the use of domain labels, lexical and semantic relations, definitions and subentries. |
first_indexed | 2024-03-10T12:58:34Z |
format | Article |
id | doaj.art-ea6865ac2b0d458cb858dfe4b35beabe |
institution | Directory Open Access Journal |
issn | 2076-3417 |
language | English |
last_indexed | 2024-03-10T12:58:34Z |
publishDate | 2021-03-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj.art-ea6865ac2b0d458cb858dfe4b35beabe2023-11-21T11:46:10ZengMDPI AGApplied Sciences2076-34172021-03-01117289210.3390/app11072892A Data Driven Approach for Raw Material TerminologyOlivera Kitanović0Ranka Stanković1Aleksandra Tomašević2Mihailo Škorić3Ivan Babić4Ljiljana Kolonja5Faculty of Mining and Geology, University of Belgrade, 11000 Belgrade, SerbiaFaculty of Mining and Geology, University of Belgrade, 11000 Belgrade, SerbiaFaculty of Mining and Geology, University of Belgrade, 11000 Belgrade, SerbiaFaculty of Mining and Geology, University of Belgrade, 11000 Belgrade, SerbiaDepartment for Informatics and Computing, University of Criminal Investigation and Police Studies, 11000 Belgrade, SerbiaFaculty of Mining and Geology, University of Belgrade, 11000 Belgrade, SerbiaThe research presented in this paper aims at creating a bilingual (sr-en), easily searchable, hypertext, born-digital, corpus-based terminological database of raw material terminology for dictionary production. The approach is based on linking dictionaries related to the raw material domain, both digitally born and printed, into a lexicon structure, aligning terminology from different dictionaries as much as possible. This paper presents the main features of this approach, data used for compilation of the terminological database, the procedure by which it has been generated and a mobile application for its use. Available (terminological) resources will be presented—paper dictionaries and digital resources related to the raw material domain, as well as general lexica morphological dictionaries. Resource preparation started with dictionary (retro)digitisation and corpora enlargement, followed by adding new Serbian terms to general lexica dictionaries, as well as adding bilingual terms. Dictionary development is relying on corpus analysis, details of which are also presented. Usage examples, collocations and concordances play an important role in raw material terminology, and have also been included in this research. Some important related issues discussed are collocation extraction methods, the use of domain labels, lexical and semantic relations, definitions and subentries.https://www.mdpi.com/2076-3417/11/7/2892raw materialminingterminologydictionaryterminology applicationmobile application |
spellingShingle | Olivera Kitanović Ranka Stanković Aleksandra Tomašević Mihailo Škorić Ivan Babić Ljiljana Kolonja A Data Driven Approach for Raw Material Terminology Applied Sciences raw material mining terminology dictionary terminology application mobile application |
title | A Data Driven Approach for Raw Material Terminology |
title_full | A Data Driven Approach for Raw Material Terminology |
title_fullStr | A Data Driven Approach for Raw Material Terminology |
title_full_unstemmed | A Data Driven Approach for Raw Material Terminology |
title_short | A Data Driven Approach for Raw Material Terminology |
title_sort | data driven approach for raw material terminology |
topic | raw material mining terminology dictionary terminology application mobile application |
url | https://www.mdpi.com/2076-3417/11/7/2892 |
work_keys_str_mv | AT oliverakitanovic adatadrivenapproachforrawmaterialterminology AT rankastankovic adatadrivenapproachforrawmaterialterminology AT aleksandratomasevic adatadrivenapproachforrawmaterialterminology AT mihailoskoric adatadrivenapproachforrawmaterialterminology AT ivanbabic adatadrivenapproachforrawmaterialterminology AT ljiljanakolonja adatadrivenapproachforrawmaterialterminology AT oliverakitanovic datadrivenapproachforrawmaterialterminology AT rankastankovic datadrivenapproachforrawmaterialterminology AT aleksandratomasevic datadrivenapproachforrawmaterialterminology AT mihailoskoric datadrivenapproachforrawmaterialterminology AT ivanbabic datadrivenapproachforrawmaterialterminology AT ljiljanakolonja datadrivenapproachforrawmaterialterminology |