ARCHITECTURE OF THE MULTIVOICE TEXT-TO-SPEECH SYSTEM
Architecture of the multimodal text to speech synthesis system based on the voice conversion framework was proposed. Such system could be tuned to the specific speaker without any costs losses on the training phase and based on one speaker base, having in TTS system. Structural scheme for this type...
Main Authors: | , |
---|---|
Format: | Article |
Language: | Russian |
Published: |
Educational institution «Belarusian State University of Informatics and Radioelectronics»
2019-06-01
|
Series: | Doklady Belorusskogo gosudarstvennogo universiteta informatiki i radioèlektroniki |
Subjects: | |
Online Access: | https://doklady.bsuir.by/jour/article/view/239 |
_version_ | 1797881201580572672 |
---|---|
author | V. A. Zakharyeu A. A. Petrovsky |
author_facet | V. A. Zakharyeu A. A. Petrovsky |
author_sort | V. A. Zakharyeu |
collection | DOAJ |
description | Architecture of the multimodal text to speech synthesis system based on the voice conversion framework was proposed. Such system could be tuned to the specific speaker without any costs losses on the training phase and based on one speaker base, having in TTS system. Structural scheme for this type of the speech synthesizer, with the description of the functionality of the main blocks were presented. Their specific characteristics are synergy approach to the architecture and text-independent mode in the training phase. |
first_indexed | 2024-04-10T03:16:19Z |
format | Article |
id | doaj.art-2cc57912790e47f0a59555ce607ea392 |
institution | Directory Open Access Journal |
issn | 1729-7648 |
language | Russian |
last_indexed | 2024-04-10T03:16:19Z |
publishDate | 2019-06-01 |
publisher | Educational institution «Belarusian State University of Informatics and Radioelectronics» |
record_format | Article |
series | Doklady Belorusskogo gosudarstvennogo universiteta informatiki i radioèlektroniki |
spelling | doaj.art-2cc57912790e47f0a59555ce607ea3922023-03-13T07:33:11ZrusEducational institution «Belarusian State University of Informatics and Radioelectronics»Doklady Belorusskogo gosudarstvennogo universiteta informatiki i radioèlektroniki1729-76482019-06-01075763238ARCHITECTURE OF THE MULTIVOICE TEXT-TO-SPEECH SYSTEMV. A. Zakharyeu0A. A. Petrovsky1Белорусский государственный университет информатики и радиоэлектроникиБелорусский государственный университет информатики и радиоэлектроникиArchitecture of the multimodal text to speech synthesis system based on the voice conversion framework was proposed. Such system could be tuned to the specific speaker without any costs losses on the training phase and based on one speaker base, having in TTS system. Structural scheme for this type of the speech synthesizer, with the description of the functionality of the main blocks were presented. Their specific characteristics are synergy approach to the architecture and text-independent mode in the training phase.https://doklady.bsuir.by/jour/article/view/239конверсия голосамультиголосовой синтезатор речи по текстутекстонезависимое обучениескрытая марковская модельпараметрическая модель представления сигнала |
spellingShingle | V. A. Zakharyeu A. A. Petrovsky ARCHITECTURE OF THE MULTIVOICE TEXT-TO-SPEECH SYSTEM Doklady Belorusskogo gosudarstvennogo universiteta informatiki i radioèlektroniki конверсия голоса мультиголосовой синтезатор речи по тексту текстонезависимое обучение скрытая марковская модель параметрическая модель представления сигнала |
title | ARCHITECTURE OF THE MULTIVOICE TEXT-TO-SPEECH SYSTEM |
title_full | ARCHITECTURE OF THE MULTIVOICE TEXT-TO-SPEECH SYSTEM |
title_fullStr | ARCHITECTURE OF THE MULTIVOICE TEXT-TO-SPEECH SYSTEM |
title_full_unstemmed | ARCHITECTURE OF THE MULTIVOICE TEXT-TO-SPEECH SYSTEM |
title_short | ARCHITECTURE OF THE MULTIVOICE TEXT-TO-SPEECH SYSTEM |
title_sort | architecture of the multivoice text to speech system |
topic | конверсия голоса мультиголосовой синтезатор речи по тексту текстонезависимое обучение скрытая марковская модель параметрическая модель представления сигнала |
url | https://doklady.bsuir.by/jour/article/view/239 |
work_keys_str_mv | AT vazakharyeu architectureofthemultivoicetexttospeechsystem AT aapetrovsky architectureofthemultivoicetexttospeechsystem |