A Comprehensive Dataset of Spelling Errors and Users’ Corrections in Croatian Language

This paper presents a unique and extensive dataset containing over 33 million entries with pairs in the form “spelling error → correction” from ispravi.me, the most popular Croatian online spellchecking service, collected since 2008. The dataset, compiled from the contribution of nearly 900,000 user...

Full description

Bibliographic Details
Main Authors: Gordan Gledec, Marko Horvat, Miljenko Mikuc, Bruno Blašković
Format: Article
Language:English
Published: MDPI AG 2023-05-01
Series:Data
Subjects:
Online Access:https://www.mdpi.com/2306-5729/8/5/89