Machine Translation System for the Industry Domain and Croatian Language

Machine translation is increasingly becoming a hot research topic in information and communication sciences, computer science and computational linguistics, due to the fact that it enables communication and transferring of meaning across different languages. As the Croatian language can be considere...

Full description

Bibliographic Details
Main Author: Ivan Dunđer
Format: Article
Language:English
Published: University of Zagreb, Faculty of organization and informatics 2020-01-01
Series:Journal of Information and Organizational Sciences
Subjects:
Online Access:https://hrcak.srce.hr/file/348061
_version_ 1827282493267509248
author Ivan Dunđer
author_facet Ivan Dunđer
author_sort Ivan Dunđer
collection DOAJ
description Machine translation is increasingly becoming a hot research topic in information and communication sciences, computer science and computational linguistics, due to the fact that it enables communication and transferring of meaning across different languages. As the Croatian language can be considered low-resourced in terms of available services and technology, development of new domain-specific machine translation systems is important, especially due to raised interest and needs of industry, academia and everyday users. Machine translation is not perfect, but it is crucial to assure acceptable quality, which is purpose-dependent. In this research, different statistical machine translation systems were built – but one system utilized domain adaptation in particular, with the intention of boosting the output of machine translation. Afterwards, extensive evaluation has been performed – in form of applying several automatic quality metrics and human evaluation with focus on various aspects. Evaluation is done in order to assess the quality of specific machine-translated text.
first_indexed 2024-04-24T09:19:11Z
format Article
id doaj.art-4b95d5a565594568a54d285d732c9783
institution Directory Open Access Journal
issn 1846-3312
1846-9418
language English
last_indexed 2024-04-24T09:19:11Z
publishDate 2020-01-01
publisher University of Zagreb, Faculty of organization and informatics
record_format Article
series Journal of Information and Organizational Sciences
spelling doaj.art-4b95d5a565594568a54d285d732c97832024-04-15T16:16:49ZengUniversity of Zagreb, Faculty of organization and informaticsJournal of Information and Organizational Sciences1846-33121846-94182020-01-01441335010.31341/jios.44.1.2Machine Translation System for the Industry Domain and Croatian LanguageIvan Dunđer0Faculty of Humanities and Social Sciences, University of Zagreb, Zagreb, CroatiaMachine translation is increasingly becoming a hot research topic in information and communication sciences, computer science and computational linguistics, due to the fact that it enables communication and transferring of meaning across different languages. As the Croatian language can be considered low-resourced in terms of available services and technology, development of new domain-specific machine translation systems is important, especially due to raised interest and needs of industry, academia and everyday users. Machine translation is not perfect, but it is crucial to assure acceptable quality, which is purpose-dependent. In this research, different statistical machine translation systems were built – but one system utilized domain adaptation in particular, with the intention of boosting the output of machine translation. Afterwards, extensive evaluation has been performed – in form of applying several automatic quality metrics and human evaluation with focus on various aspects. Evaluation is done in order to assess the quality of specific machine-translated text.https://hrcak.srce.hr/file/348061statistical machine translationdomain adaptationautomatic quality metricshuman quality evaluationerror classificationCroatian language
spellingShingle Ivan Dunđer
Machine Translation System for the Industry Domain and Croatian Language
Journal of Information and Organizational Sciences
statistical machine translation
domain adaptation
automatic quality metrics
human quality evaluation
error classification
Croatian language
title Machine Translation System for the Industry Domain and Croatian Language
title_full Machine Translation System for the Industry Domain and Croatian Language
title_fullStr Machine Translation System for the Industry Domain and Croatian Language
title_full_unstemmed Machine Translation System for the Industry Domain and Croatian Language
title_short Machine Translation System for the Industry Domain and Croatian Language
title_sort machine translation system for the industry domain and croatian language
topic statistical machine translation
domain adaptation
automatic quality metrics
human quality evaluation
error classification
Croatian language
url https://hrcak.srce.hr/file/348061
work_keys_str_mv AT ivandunđer machinetranslationsystemfortheindustrydomainandcroatianlanguage