MiNgMatch—A Fast N-gram Model for Word Segmentation of the Ainu Language

Word segmentation is an essential task in automatic language processing for languages where there are no explicit word boundary markers, or where space-delimited orthographic words are too coarse-grained. In this paper we introduce the MiNgMatch Segmenter—a fast word segmentation algorithm...

Full description

Bibliographic Details
Main Authors: Karol Nowakowski, Michal Ptaszynski, Fumito Masui
Format: Article
Language:English
Published: MDPI AG 2019-10-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/10/10/317