Classifying cuneiform symbols using machine learning algorithms with unigram features on a balanced dataset
Recognizing written languages using symbols written in cuneiform is a tough endeavor due to the lack of information and the challenge of the process of tokenization. The Cuneiform Language Identification (CLI) dataset attempts to understand seven cuneiform languages and dialects, including Sumerian...
Main Authors: | Mahmood Maha, Jasem Farah Maath, Mukhlif Abdulrahman Abbas, AL-Khateeb Belal |
---|---|
Format: | Article |
Language: | English |
Published: |
De Gruyter
2023-09-01
|
Series: | Journal of Intelligent Systems |
Subjects: | |
Online Access: | https://doi.org/10.1515/jisys-2023-0087 |
Similar Items
-
Extensive Review of State-of-the-Art Classification Techniques for Cuneiform Symbol Imaging: Open Issues and Challenges
by: Farah Maath, et al.
Published: (2023-08-01) -
ENIGMA: an enterotype-like unigram mixture model for microbial association analysis
by: Ko Abe, et al.
Published: (2019-04-01) -
Bipartite medial cuneiform: a potential cause of midfoot pain - case report
by: Thales Augusto de Santa Helena Ilha, et al.
Published: (2023-10-01) -
Comparison of Unigram, HMM, CRF and Brill's Part-of-Speech Taggers Available in NLTK Library
by: Michal Kvet, et al.
Published: (2023-05-01) -
Morphological characterization of Maize (Zea mays.) utilising the stage-wise structural and architectural perspective from temporal fully-polarimetric SAR
by: Dipanwita Haldar, et al.
Published: (2024-06-01)