Leveraging Linked Open Data to Automatically Answer Arabic Questions

The interchangeably connected Web technologies and the advancements that accompany the semantic web content's leaps, have raised many challenges in the results' retrieval process especially for the Arabic Language. This research targets an important, yet insufficiently precedent, area in u...

Full description

Bibliographic Details
Main Authors: Mohammad Al-Smadi, Islam Al-Dalabih, Yaser Jararweh, Patrick Juola
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8915781/
_version_ 1818427577847513088
author Mohammad Al-Smadi
Islam Al-Dalabih
Yaser Jararweh
Patrick Juola
author_facet Mohammad Al-Smadi
Islam Al-Dalabih
Yaser Jararweh
Patrick Juola
author_sort Mohammad Al-Smadi
collection DOAJ
description The interchangeably connected Web technologies and the advancements that accompany the semantic web content's leaps, have raised many challenges in the results' retrieval process especially for the Arabic Language. This research targets an important, yet insufficiently precedent, area in using Linked Open Data (LOD) for Automatic Question Answering systems in the Arabic Language. The significance of work presented, comes from its ability to overcome many challenges in querying Arabic content. Some of these challenges are: (a) bridging the gap between natural language and linked data by mapping users' queries to a standard semantic web query language such as SPARQL, (b) facilitating multilingual access to semantic data, and (c) maintaining the quality of data. Another challenging aspect was the lack of related work and publicly available resources for Arabic Question Answering Systems over Linked Data, despite the vastly growing Arabic corpus on the web. This paper presents a novel approach that targets Automatic Arabic Questions' Answering Systems whilst bypassing many featured challenges in the field. A hybrid approach that evaluates the effectiveness of using LOD to automatically answer Arabic questions is developed. The approach is developed to map users' questions in Modern Standard Arabic, to a standard query language for LOD (i.e. SPARQL) through: (i) extracting entities from questions and linking them over the web using Named-Entity Recognition and Disambiguation (NER/NED), and (ii) extracting properties among extracted named entities using a dependency parsing approach integrated with Wikidata ontology. To evaluate our proposed system, an Arabic questions dataset was created including: (a) Question body in Arabic language, (b) Question type, (c) SPARQL Query formulation, and (d) Question answer. Evaluation results are promising with a Precision of 84%, a Recall of 81.3%, and an F-Measure of 82.8%.
first_indexed 2024-12-14T14:47:57Z
format Article
id doaj.art-20c96fce10794d1587ba635721d2d96e
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-14T14:47:57Z
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-20c96fce10794d1587ba635721d2d96e2022-12-21T22:57:13ZengIEEEIEEE Access2169-35362019-01-01717712217713610.1109/ACCESS.2019.29562338915781Leveraging Linked Open Data to Automatically Answer Arabic QuestionsMohammad Al-Smadi0https://orcid.org/0000-0002-7808-6962Islam Al-Dalabih1https://orcid.org/0000-0003-1242-9528Yaser Jararweh2https://orcid.org/0000-0002-4403-3846Patrick Juola3https://orcid.org/0000-0003-2578-6233Computer Science Department, Jordan University of Science and Technology, Irbid, JordanComputer Science Department, Jordan University of Science and Technology, Irbid, JordanComputer Science Department, Jordan University of Science and Technology, Irbid, JordanMathematics and Computer Science Department, Duquesne University, Pittsburgh, PA, USAThe interchangeably connected Web technologies and the advancements that accompany the semantic web content's leaps, have raised many challenges in the results' retrieval process especially for the Arabic Language. This research targets an important, yet insufficiently precedent, area in using Linked Open Data (LOD) for Automatic Question Answering systems in the Arabic Language. The significance of work presented, comes from its ability to overcome many challenges in querying Arabic content. Some of these challenges are: (a) bridging the gap between natural language and linked data by mapping users' queries to a standard semantic web query language such as SPARQL, (b) facilitating multilingual access to semantic data, and (c) maintaining the quality of data. Another challenging aspect was the lack of related work and publicly available resources for Arabic Question Answering Systems over Linked Data, despite the vastly growing Arabic corpus on the web. This paper presents a novel approach that targets Automatic Arabic Questions' Answering Systems whilst bypassing many featured challenges in the field. A hybrid approach that evaluates the effectiveness of using LOD to automatically answer Arabic questions is developed. The approach is developed to map users' questions in Modern Standard Arabic, to a standard query language for LOD (i.e. SPARQL) through: (i) extracting entities from questions and linking them over the web using Named-Entity Recognition and Disambiguation (NER/NED), and (ii) extracting properties among extracted named entities using a dependency parsing approach integrated with Wikidata ontology. To evaluate our proposed system, an Arabic questions dataset was created including: (a) Question body in Arabic language, (b) Question type, (c) SPARQL Query formulation, and (d) Question answer. Evaluation results are promising with a Precision of 84%, a Recall of 81.3%, and an F-Measure of 82.8%.https://ieeexplore.ieee.org/document/8915781/Semantic webquestion answering systemsstructured datanatural language processingArabic language
spellingShingle Mohammad Al-Smadi
Islam Al-Dalabih
Yaser Jararweh
Patrick Juola
Leveraging Linked Open Data to Automatically Answer Arabic Questions
IEEE Access
Semantic web
question answering systems
structured data
natural language processing
Arabic language
title Leveraging Linked Open Data to Automatically Answer Arabic Questions
title_full Leveraging Linked Open Data to Automatically Answer Arabic Questions
title_fullStr Leveraging Linked Open Data to Automatically Answer Arabic Questions
title_full_unstemmed Leveraging Linked Open Data to Automatically Answer Arabic Questions
title_short Leveraging Linked Open Data to Automatically Answer Arabic Questions
title_sort leveraging linked open data to automatically answer arabic questions
topic Semantic web
question answering systems
structured data
natural language processing
Arabic language
url https://ieeexplore.ieee.org/document/8915781/
work_keys_str_mv AT mohammadalsmadi leveraginglinkedopendatatoautomaticallyanswerarabicquestions
AT islamaldalabih leveraginglinkedopendatatoautomaticallyanswerarabicquestions
AT yaserjararweh leveraginglinkedopendatatoautomaticallyanswerarabicquestions
AT patrickjuola leveraginglinkedopendatatoautomaticallyanswerarabicquestions