Pre-trained transformer-based language models for Sundanese

Abstract The Sundanese language has over 32 million speakers worldwide, but the language has reaped little to no benefits from the recent advances in natural language understanding. Like other low-resource languages, the only alternative is to fine-tune existing multilingual models. In this paper, w...

Full description

Bibliographic Details
Main Authors: Wilson Wongso, Henry Lucky, Derwin Suhartono
Format: Article
Language:English
Published: SpringerOpen 2022-04-01
Series:Journal of Big Data
Subjects:
Online Access:https://doi.org/10.1186/s40537-022-00590-7