Cascade Speech Translation for the Kazakh Language

Speech translation systems have become indispensable in facilitating seamless communication across language barriers. This paper presents a cascade speech translation system tailored specifically for translating speech from the Kazakh language to Russian. The system aims to enable effective cross-li...

Full description

Bibliographic Details
Main Authors: Zhanibek Kozhirbayev, Talgat Islamgozhayev
Format: Article
Language:English
Published: MDPI AG 2023-08-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/15/8900
_version_ 1797587045808340992
author Zhanibek Kozhirbayev
Talgat Islamgozhayev
author_facet Zhanibek Kozhirbayev
Talgat Islamgozhayev
author_sort Zhanibek Kozhirbayev
collection DOAJ
description Speech translation systems have become indispensable in facilitating seamless communication across language barriers. This paper presents a cascade speech translation system tailored specifically for translating speech from the Kazakh language to Russian. The system aims to enable effective cross-lingual communication between Kazakh and Russian speakers, addressing the unique challenges posed by these languages. To develop the cascade speech translation system, we first created a dedicated speech translation dataset ST-kk-ru based on the ISSAI Corpus. The ST-kk-ru dataset comprises a large collection of Kazakh speech recordings along with their corresponding Russian translations. The automatic speech recognition (ASR) module of the system utilizes deep learning techniques to convert spoken Kazakh input into text. The machine translation (MT) module employs state-of-the-art neural machine translation methods, leveraging the parallel Kazakh-Russian translations available in the dataset to generate accurate translations. By conducting extensive experiments and evaluations, we have thoroughly assessed the performance of the cascade speech translation system on the ST-kk-ru dataset. The outcomes of our evaluation highlight the effectiveness of incorporating additional datasets for both the ASR and MT modules. This augmentation leads to a significant improvement in the performance of the cascade speech translation system, increasing the BLEU score by approximately 2 points when translating from Kazakh to Russian. These findings underscore the importance of leveraging supplementary data to enhance the capabilities of speech translation systems.
first_indexed 2024-03-11T00:31:40Z
format Article
id doaj.art-c9d0d6a9ee8246948641cb0e8bfaeff2
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-11T00:31:40Z
publishDate 2023-08-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-c9d0d6a9ee8246948641cb0e8bfaeff22023-11-18T22:39:01ZengMDPI AGApplied Sciences2076-34172023-08-011315890010.3390/app13158900Cascade Speech Translation for the Kazakh LanguageZhanibek Kozhirbayev0Talgat Islamgozhayev1National Laboratory Astana, Nazarbayev University, Astana 010000, KazakhstanNational Laboratory Astana, Nazarbayev University, Astana 010000, KazakhstanSpeech translation systems have become indispensable in facilitating seamless communication across language barriers. This paper presents a cascade speech translation system tailored specifically for translating speech from the Kazakh language to Russian. The system aims to enable effective cross-lingual communication between Kazakh and Russian speakers, addressing the unique challenges posed by these languages. To develop the cascade speech translation system, we first created a dedicated speech translation dataset ST-kk-ru based on the ISSAI Corpus. The ST-kk-ru dataset comprises a large collection of Kazakh speech recordings along with their corresponding Russian translations. The automatic speech recognition (ASR) module of the system utilizes deep learning techniques to convert spoken Kazakh input into text. The machine translation (MT) module employs state-of-the-art neural machine translation methods, leveraging the parallel Kazakh-Russian translations available in the dataset to generate accurate translations. By conducting extensive experiments and evaluations, we have thoroughly assessed the performance of the cascade speech translation system on the ST-kk-ru dataset. The outcomes of our evaluation highlight the effectiveness of incorporating additional datasets for both the ASR and MT modules. This augmentation leads to a significant improvement in the performance of the cascade speech translation system, increasing the BLEU score by approximately 2 points when translating from Kazakh to Russian. These findings underscore the importance of leveraging supplementary data to enhance the capabilities of speech translation systems.https://www.mdpi.com/2076-3417/13/15/8900cascade speech translationKazakh languageRussian languageautomatic speech recognitionmachine translationcross-lingual communication
spellingShingle Zhanibek Kozhirbayev
Talgat Islamgozhayev
Cascade Speech Translation for the Kazakh Language
Applied Sciences
cascade speech translation
Kazakh language
Russian language
automatic speech recognition
machine translation
cross-lingual communication
title Cascade Speech Translation for the Kazakh Language
title_full Cascade Speech Translation for the Kazakh Language
title_fullStr Cascade Speech Translation for the Kazakh Language
title_full_unstemmed Cascade Speech Translation for the Kazakh Language
title_short Cascade Speech Translation for the Kazakh Language
title_sort cascade speech translation for the kazakh language
topic cascade speech translation
Kazakh language
Russian language
automatic speech recognition
machine translation
cross-lingual communication
url https://www.mdpi.com/2076-3417/13/15/8900
work_keys_str_mv AT zhanibekkozhirbayev cascadespeechtranslationforthekazakhlanguage
AT talgatislamgozhayev cascadespeechtranslationforthekazakhlanguage