Recurrent Neural Network-Gated Recurrent Unit for Indonesia-Sentani Papua Machine Translation

The Papuan Sentani language is spoken in the city of Jayapura, Papua. The law states the need to preserve regional languages. One of them is by building an Indonesian-Sentani Papua translation machine. The problem is how to build a translation machine and what model to choose in doing so. The model...

Full description

Bibliographic Details
Main Authors: Rizkial Achmad, Yokelin Tokoro, Jusuf Haurissa, Andik Wijanarko
Format: Article
Language:English
Published: Informatics Department, Faculty of Computer Science Bina Darma University 2023-12-01
Series:Journal of Information Systems and Informatics
Subjects:
Online Access:https://journal-isi.org/index.php/isi/article/view/597
Description
Summary:The Papuan Sentani language is spoken in the city of Jayapura, Papua. The law states the need to preserve regional languages. One of them is by building an Indonesian-Sentani Papua translation machine. The problem is how to build a translation machine and what model to choose in doing so. The model chosen is Recurrent Neural Network – Gated Recurrent Units (RNN-GRU) which has been widely used to build regional languages in Indonesia. The method used is an experiment starting from creating a parallel corpus, followed by corpus training using the RNN-GRU model, and the final step is conducting an evaluation using Bilingual Evaluation Understudy (BLEU) to find out the score. The parallel corpus used contains 281 sentences, each sentence has an average length of 8 words. The training time required is 3 hours without using a GPU. The result of this research was that a fairly good BLEU score was obtained, namely 35.3, which means that the RNN-GRU model and parallel corpus produced sufficient translation quality and could still be improved.
ISSN:2656-5935
2656-4882