Exploring West African Folk Narrative Texts Using Machine Learning

This paper examines how machine learning (ML) and natural language processing (NLP) can be used to identify, analyze, and generate West African folk tales. Two corpora of West African and Western European folk tales are compiled and used in three experiments on cross-cultural folk tale analysis. In...

Full description

Bibliographic Details
Main Authors: Gossa Lô, Victor de Boer, Chris J. van Aart
Format: Article
Language:English
Published: MDPI AG 2020-04-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/11/5/236
_version_ 1797569586879528960
author Gossa Lô
Victor de Boer
Chris J. van Aart
author_facet Gossa Lô
Victor de Boer
Chris J. van Aart
author_sort Gossa Lô
collection DOAJ
description This paper examines how machine learning (ML) and natural language processing (NLP) can be used to identify, analyze, and generate West African folk tales. Two corpora of West African and Western European folk tales are compiled and used in three experiments on cross-cultural folk tale analysis. In the text generation experiment, two types of deep learning text generators are built and trained on the West African corpus. We show that although the texts range between semantic and syntactic coherence, each of them contains West African features. The second experiment further examines the distinction between the West African and Western European folk tales by comparing the performance of an LSTM (acc. 0.79) with a BoW classifier (acc. 0.93), indicating that the two corpora can be clearly distinguished in terms of vocabulary. An interactive t-SNE visualization of a hybrid classifier (acc. 0.85) highlights the culture-specific words for both. The third experiment describes an ML analysis of narrative structures. Classifiers trained on parts of folk tales according to the three-act structure are quite capable of distinguishing these parts (acc. 0.78). Common n-grams extracted from these parts not only underline cross-cultural distinctions in narrative structures, but also show the overlap between verbal and written West African narratives.
first_indexed 2024-03-10T20:13:48Z
format Article
id doaj.art-a92c3381fce14c958a98ea82f9580d37
institution Directory Open Access Journal
issn 2078-2489
language English
last_indexed 2024-03-10T20:13:48Z
publishDate 2020-04-01
publisher MDPI AG
record_format Article
series Information
spelling doaj.art-a92c3381fce14c958a98ea82f9580d372023-11-19T22:44:40ZengMDPI AGInformation2078-24892020-04-0111523610.3390/info11050236Exploring West African Folk Narrative Texts Using Machine LearningGossa Lô0Victor de Boer1Chris J. van Aart2Computer Science Department, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, The NetherlandsComputer Science Department, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, The NetherlandsBolesian B.V., Hooghiemstraplein 85, 3514 AX Utrecht, The NetherlandsThis paper examines how machine learning (ML) and natural language processing (NLP) can be used to identify, analyze, and generate West African folk tales. Two corpora of West African and Western European folk tales are compiled and used in three experiments on cross-cultural folk tale analysis. In the text generation experiment, two types of deep learning text generators are built and trained on the West African corpus. We show that although the texts range between semantic and syntactic coherence, each of them contains West African features. The second experiment further examines the distinction between the West African and Western European folk tales by comparing the performance of an LSTM (acc. 0.79) with a BoW classifier (acc. 0.93), indicating that the two corpora can be clearly distinguished in terms of vocabulary. An interactive t-SNE visualization of a hybrid classifier (acc. 0.85) highlights the culture-specific words for both. The third experiment describes an ML analysis of narrative structures. Classifiers trained on parts of folk tales according to the three-act structure are quite capable of distinguishing these parts (acc. 0.78). Common n-grams extracted from these parts not only underline cross-cultural distinctions in narrative structures, but also show the overlap between verbal and written West African narratives.https://www.mdpi.com/2078-2489/11/5/236deep learningtext generationtext classificationstorytellingWest Africafolk tales
spellingShingle Gossa Lô
Victor de Boer
Chris J. van Aart
Exploring West African Folk Narrative Texts Using Machine Learning
Information
deep learning
text generation
text classification
storytelling
West Africa
folk tales
title Exploring West African Folk Narrative Texts Using Machine Learning
title_full Exploring West African Folk Narrative Texts Using Machine Learning
title_fullStr Exploring West African Folk Narrative Texts Using Machine Learning
title_full_unstemmed Exploring West African Folk Narrative Texts Using Machine Learning
title_short Exploring West African Folk Narrative Texts Using Machine Learning
title_sort exploring west african folk narrative texts using machine learning
topic deep learning
text generation
text classification
storytelling
West Africa
folk tales
url https://www.mdpi.com/2078-2489/11/5/236
work_keys_str_mv AT gossalo exploringwestafricanfolknarrativetextsusingmachinelearning
AT victordeboer exploringwestafricanfolknarrativetextsusingmachinelearning
AT chrisjvanaart exploringwestafricanfolknarrativetextsusingmachinelearning