Arabic Narrative Question Answering (QA) Using Transformer Models

The Narrative question answering (QA) problem involves generating accurate, relevant, and human-like answers to questions based on the comprehension of a story consisting of logically connected paragraphs. Developing Narrative QA models allows students to ask about inconspicuous narrative elements w...

Full description

Bibliographic Details
Main Authors: Mohammad A. Ateeq, Sabrina Tiun, Hamed Abdelhaq, Nawras Rahhal
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10376168/
_version_ 1797361537521811456
author Mohammad A. Ateeq
Sabrina Tiun
Hamed Abdelhaq
Nawras Rahhal
author_facet Mohammad A. Ateeq
Sabrina Tiun
Hamed Abdelhaq
Nawras Rahhal
author_sort Mohammad A. Ateeq
collection DOAJ
description The Narrative question answering (QA) problem involves generating accurate, relevant, and human-like answers to questions based on the comprehension of a story consisting of logically connected paragraphs. Developing Narrative QA models allows students to ask about inconspicuous narrative elements while reading the story. However, this problem remains unexplored for the Arabic language because of the lack of Arabic narrative datasets. To address this gap, we present the Arabic-NarrativeQA dataset, which is the first dataset specifically designed for machine-reading comprehension of Arabic stories. This dataset consists of two parts: translation of an English NarrativeQA dataset and a collection of new question-answer pairs based on Arabic stories. Furthermore, we implement the Arabic-NarrativeQA system using the Ranker-Reader pipeline, exploring and evaluating various approaches at each stage to identify the most effective ones. To avoid the need for an extensive data collection process, we utilize cross-lingual transfer learning techniques to leverage knowledge transfer from the English Narrative QA dataset to the Arabic-NarrativeQA system. Experiments show that incorporating cross-lingual transfer learning significantly improved the performance of the reader models. Furthermore, the question’s evidence information provided in the Arabic-NarrativeQA dataset enables the learnable rankers to effectively identify and select the pertinent paragraphs. Finally, we examine and categorize challenging questions that require a deep understanding of the stories. By incorporating these question types into the introduced dataset, we show that existing reading comprehension models struggle to answer them, and further model development should be conducted. To promote further research on this task, we make both the Arabic-NarrativeQA dataset and the pre-trained models publicly available.
first_indexed 2024-03-08T15:55:04Z
format Article
id doaj.art-e535757f50fe43f784ea904beaec0dcf
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-08T15:55:04Z
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-e535757f50fe43f784ea904beaec0dcf2024-01-09T00:04:13ZengIEEEIEEE Access2169-35362024-01-01122760277710.1109/ACCESS.2023.334841010376168Arabic Narrative Question Answering (QA) Using Transformer ModelsMohammad A. Ateeq0https://orcid.org/0000-0003-0296-5562Sabrina Tiun1https://orcid.org/0000-0002-1134-973XHamed Abdelhaq2https://orcid.org/0000-0003-4803-6689Nawras Rahhal3https://orcid.org/0009-0009-4599-7135Faculty of Information Science and Technology, Centre for Artificial Intelligence Technology, Universiti Kebangsaan Malaysia, Bangi, MalaysiaFaculty of Information Science and Technology, Centre for Artificial Intelligence Technology, Universiti Kebangsaan Malaysia, Bangi, MalaysiaDepartment of Computer Science, An-Najah National University, Nablus, PalestineDepartment of Computer Science, An-Najah National University, Nablus, PalestineThe Narrative question answering (QA) problem involves generating accurate, relevant, and human-like answers to questions based on the comprehension of a story consisting of logically connected paragraphs. Developing Narrative QA models allows students to ask about inconspicuous narrative elements while reading the story. However, this problem remains unexplored for the Arabic language because of the lack of Arabic narrative datasets. To address this gap, we present the Arabic-NarrativeQA dataset, which is the first dataset specifically designed for machine-reading comprehension of Arabic stories. This dataset consists of two parts: translation of an English NarrativeQA dataset and a collection of new question-answer pairs based on Arabic stories. Furthermore, we implement the Arabic-NarrativeQA system using the Ranker-Reader pipeline, exploring and evaluating various approaches at each stage to identify the most effective ones. To avoid the need for an extensive data collection process, we utilize cross-lingual transfer learning techniques to leverage knowledge transfer from the English Narrative QA dataset to the Arabic-NarrativeQA system. Experiments show that incorporating cross-lingual transfer learning significantly improved the performance of the reader models. Furthermore, the question’s evidence information provided in the Arabic-NarrativeQA dataset enables the learnable rankers to effectively identify and select the pertinent paragraphs. Finally, we examine and categorize challenging questions that require a deep understanding of the stories. By incorporating these question types into the introduced dataset, we show that existing reading comprehension models struggle to answer them, and further model development should be conducted. To promote further research on this task, we make both the Arabic-NarrativeQA dataset and the pre-trained models publicly available.https://ieeexplore.ieee.org/document/10376168/Arabic question answeringanswer generationcross-lingual transfer learningreading comprehensionnarrative QA
spellingShingle Mohammad A. Ateeq
Sabrina Tiun
Hamed Abdelhaq
Nawras Rahhal
Arabic Narrative Question Answering (QA) Using Transformer Models
IEEE Access
Arabic question answering
answer generation
cross-lingual transfer learning
reading comprehension
narrative QA
title Arabic Narrative Question Answering (QA) Using Transformer Models
title_full Arabic Narrative Question Answering (QA) Using Transformer Models
title_fullStr Arabic Narrative Question Answering (QA) Using Transformer Models
title_full_unstemmed Arabic Narrative Question Answering (QA) Using Transformer Models
title_short Arabic Narrative Question Answering (QA) Using Transformer Models
title_sort arabic narrative question answering qa using transformer models
topic Arabic question answering
answer generation
cross-lingual transfer learning
reading comprehension
narrative QA
url https://ieeexplore.ieee.org/document/10376168/
work_keys_str_mv AT mohammadaateeq arabicnarrativequestionansweringqausingtransformermodels
AT sabrinatiun arabicnarrativequestionansweringqausingtransformermodels
AT hamedabdelhaq arabicnarrativequestionansweringqausingtransformermodels
AT nawrasrahhal arabicnarrativequestionansweringqausingtransformermodels