Towards Machine-Readable Lexicons for South African Bantu languages

Lexical information for South African Bantu languages is not readily available in the form of machine-readable lexicons. At present the availability of lexical information is restricted to a variety of paper dictionaries. These dictionaries display considerable diversity in the organisation and rep...

Full description

Bibliographic Details
Main Authors: Sonja E. Bosch, Laurette Pretorius, Jackie Jones
Format: Article
Language:English
Published: Nordic Africa Research Network 2007-06-01
Series:Nordic Journal of African Studies
Online Access:https://www.njas.fi/njas/article/view/62
_version_ 1797696441696649216
author Sonja E. Bosch
Laurette Pretorius
Jackie Jones
author_facet Sonja E. Bosch
Laurette Pretorius
Jackie Jones
author_sort Sonja E. Bosch
collection DOAJ
description Lexical information for South African Bantu languages is not readily available in the form of machine-readable lexicons. At present the availability of lexical information is restricted to a variety of paper dictionaries. These dictionaries display considerable diversity in the organisation and representation of data. In order to proceed towards the development of reusable and suitably standardised machine-readable lexicons for these languages, a data model for lexical entries becomes a prerequisite. In this study the general purpose model as developed by Bell and Bird (2000) is used as a point of departure. Firstly, the extent to which the Bell and Bird (2000) data model may be applied to and modified for the above-mentioned languages is investigated. Initial investigations indicate that modification of this data model is necessary to make provision for the specific requirements of lexical entries in these languages. Secondly, a data model in the form of an XML DTD for the languages in question, based on our findings regarding Bell and Bird (2000) and Weber (2002) is presented. Included in this model are additional particular requirements for complete and appropriate representation of linguistic information as identified in the study of available paper dictionaries.
first_indexed 2024-03-12T03:26:26Z
format Article
id doaj.art-e11e66fb0f8f4a9486ecc2dea5f3d3c6
institution Directory Open Access Journal
issn 1459-9465
language English
last_indexed 2024-03-12T03:26:26Z
publishDate 2007-06-01
publisher Nordic Africa Research Network
record_format Article
series Nordic Journal of African Studies
spelling doaj.art-e11e66fb0f8f4a9486ecc2dea5f3d3c62023-09-03T13:37:58ZengNordic Africa Research NetworkNordic Journal of African Studies1459-94652007-06-0116210.53228/njas.v16i2.62Towards Machine-Readable Lexicons for South African Bantu languagesSonja E. BoschLaurette PretoriusJackie Jones Lexical information for South African Bantu languages is not readily available in the form of machine-readable lexicons. At present the availability of lexical information is restricted to a variety of paper dictionaries. These dictionaries display considerable diversity in the organisation and representation of data. In order to proceed towards the development of reusable and suitably standardised machine-readable lexicons for these languages, a data model for lexical entries becomes a prerequisite. In this study the general purpose model as developed by Bell and Bird (2000) is used as a point of departure. Firstly, the extent to which the Bell and Bird (2000) data model may be applied to and modified for the above-mentioned languages is investigated. Initial investigations indicate that modification of this data model is necessary to make provision for the specific requirements of lexical entries in these languages. Secondly, a data model in the form of an XML DTD for the languages in question, based on our findings regarding Bell and Bird (2000) and Weber (2002) is presented. Included in this model are additional particular requirements for complete and appropriate representation of linguistic information as identified in the study of available paper dictionaries. https://www.njas.fi/njas/article/view/62
spellingShingle Sonja E. Bosch
Laurette Pretorius
Jackie Jones
Towards Machine-Readable Lexicons for South African Bantu languages
Nordic Journal of African Studies
title Towards Machine-Readable Lexicons for South African Bantu languages
title_full Towards Machine-Readable Lexicons for South African Bantu languages
title_fullStr Towards Machine-Readable Lexicons for South African Bantu languages
title_full_unstemmed Towards Machine-Readable Lexicons for South African Bantu languages
title_short Towards Machine-Readable Lexicons for South African Bantu languages
title_sort towards machine readable lexicons for south african bantu languages
url https://www.njas.fi/njas/article/view/62
work_keys_str_mv AT sonjaebosch towardsmachinereadablelexiconsforsouthafricanbantulanguages
AT laurettepretorius towardsmachinereadablelexiconsforsouthafricanbantulanguages
AT jackiejones towardsmachinereadablelexiconsforsouthafricanbantulanguages