English-to-Chinese transliteration with phonetic auxiliary task

Approaching named entities transliteration as a Neural Machine Translation (NMT) problem is common practice. While many have applied various NMT techniques to enhance machine transliteration models, few focus on the linguistic features particular to the relevant languages. In this paper, we investig...

Full description

Bibliographic Details
Main Authors: He, Y, Cohen, SB
Format: Conference item
Language:English
Published: Association for Computational Linguistics 2020
_version_ 1826309413471780864
author He, Y
Cohen, SB
author_facet He, Y
Cohen, SB
author_sort He, Y
collection OXFORD
description Approaching named entities transliteration as a Neural Machine Translation (NMT) problem is common practice. While many have applied various NMT techniques to enhance machine transliteration models, few focus on the linguistic features particular to the relevant languages. In this paper, we investigate the effect of incorporating phonetic features for English-to-Chinese transliteration under the multi-task learning (MTL) setting—where we define a phonetic auxiliary task aimed to improve the generalization performance of the main transliteration task. In addition to our system, we also release a new English-to-Chinese dataset and propose a novel evaluation metric which considers multiple possible transliterations given a source name. Our results show that the multi-task model achieves similar performance as the previous state of the art with a model of a much smaller size.
first_indexed 2024-03-07T07:33:50Z
format Conference item
id oxford-uuid:bbefdf16-c939-4665-8e3f-9bc9312e6152
institution University of Oxford
language English
last_indexed 2024-03-07T07:33:50Z
publishDate 2020
publisher Association for Computational Linguistics
record_format dspace
spelling oxford-uuid:bbefdf16-c939-4665-8e3f-9bc9312e61522023-02-20T10:36:18ZEnglish-to-Chinese transliteration with phonetic auxiliary taskConference itemhttp://purl.org/coar/resource_type/c_5794uuid:bbefdf16-c939-4665-8e3f-9bc9312e6152EnglishSymplectic ElementsAssociation for Computational Linguistics2020He, YCohen, SBApproaching named entities transliteration as a Neural Machine Translation (NMT) problem is common practice. While many have applied various NMT techniques to enhance machine transliteration models, few focus on the linguistic features particular to the relevant languages. In this paper, we investigate the effect of incorporating phonetic features for English-to-Chinese transliteration under the multi-task learning (MTL) setting—where we define a phonetic auxiliary task aimed to improve the generalization performance of the main transliteration task. In addition to our system, we also release a new English-to-Chinese dataset and propose a novel evaluation metric which considers multiple possible transliterations given a source name. Our results show that the multi-task model achieves similar performance as the previous state of the art with a model of a much smaller size.
spellingShingle He, Y
Cohen, SB
English-to-Chinese transliteration with phonetic auxiliary task
title English-to-Chinese transliteration with phonetic auxiliary task
title_full English-to-Chinese transliteration with phonetic auxiliary task
title_fullStr English-to-Chinese transliteration with phonetic auxiliary task
title_full_unstemmed English-to-Chinese transliteration with phonetic auxiliary task
title_short English-to-Chinese transliteration with phonetic auxiliary task
title_sort english to chinese transliteration with phonetic auxiliary task
work_keys_str_mv AT hey englishtochinesetransliterationwithphoneticauxiliarytask
AT cohensb englishtochinesetransliterationwithphoneticauxiliarytask