Leveraging Linked Open Data to Automatically Answer Arabic Questions
The interchangeably connected Web technologies and the advancements that accompany the semantic web content's leaps, have raised many challenges in the results' retrieval process especially for the Arabic Language. This research targets an important, yet insufficiently precedent, area in u...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8915781/ |
_version_ | 1818427577847513088 |
---|---|
author | Mohammad Al-Smadi Islam Al-Dalabih Yaser Jararweh Patrick Juola |
author_facet | Mohammad Al-Smadi Islam Al-Dalabih Yaser Jararweh Patrick Juola |
author_sort | Mohammad Al-Smadi |
collection | DOAJ |
description | The interchangeably connected Web technologies and the advancements that accompany the semantic web content's leaps, have raised many challenges in the results' retrieval process especially for the Arabic Language. This research targets an important, yet insufficiently precedent, area in using Linked Open Data (LOD) for Automatic Question Answering systems in the Arabic Language. The significance of work presented, comes from its ability to overcome many challenges in querying Arabic content. Some of these challenges are: (a) bridging the gap between natural language and linked data by mapping users' queries to a standard semantic web query language such as SPARQL, (b) facilitating multilingual access to semantic data, and (c) maintaining the quality of data. Another challenging aspect was the lack of related work and publicly available resources for Arabic Question Answering Systems over Linked Data, despite the vastly growing Arabic corpus on the web. This paper presents a novel approach that targets Automatic Arabic Questions' Answering Systems whilst bypassing many featured challenges in the field. A hybrid approach that evaluates the effectiveness of using LOD to automatically answer Arabic questions is developed. The approach is developed to map users' questions in Modern Standard Arabic, to a standard query language for LOD (i.e. SPARQL) through: (i) extracting entities from questions and linking them over the web using Named-Entity Recognition and Disambiguation (NER/NED), and (ii) extracting properties among extracted named entities using a dependency parsing approach integrated with Wikidata ontology. To evaluate our proposed system, an Arabic questions dataset was created including: (a) Question body in Arabic language, (b) Question type, (c) SPARQL Query formulation, and (d) Question answer. Evaluation results are promising with a Precision of 84%, a Recall of 81.3%, and an F-Measure of 82.8%. |
first_indexed | 2024-12-14T14:47:57Z |
format | Article |
id | doaj.art-20c96fce10794d1587ba635721d2d96e |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-14T14:47:57Z |
publishDate | 2019-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-20c96fce10794d1587ba635721d2d96e2022-12-21T22:57:13ZengIEEEIEEE Access2169-35362019-01-01717712217713610.1109/ACCESS.2019.29562338915781Leveraging Linked Open Data to Automatically Answer Arabic QuestionsMohammad Al-Smadi0https://orcid.org/0000-0002-7808-6962Islam Al-Dalabih1https://orcid.org/0000-0003-1242-9528Yaser Jararweh2https://orcid.org/0000-0002-4403-3846Patrick Juola3https://orcid.org/0000-0003-2578-6233Computer Science Department, Jordan University of Science and Technology, Irbid, JordanComputer Science Department, Jordan University of Science and Technology, Irbid, JordanComputer Science Department, Jordan University of Science and Technology, Irbid, JordanMathematics and Computer Science Department, Duquesne University, Pittsburgh, PA, USAThe interchangeably connected Web technologies and the advancements that accompany the semantic web content's leaps, have raised many challenges in the results' retrieval process especially for the Arabic Language. This research targets an important, yet insufficiently precedent, area in using Linked Open Data (LOD) for Automatic Question Answering systems in the Arabic Language. The significance of work presented, comes from its ability to overcome many challenges in querying Arabic content. Some of these challenges are: (a) bridging the gap between natural language and linked data by mapping users' queries to a standard semantic web query language such as SPARQL, (b) facilitating multilingual access to semantic data, and (c) maintaining the quality of data. Another challenging aspect was the lack of related work and publicly available resources for Arabic Question Answering Systems over Linked Data, despite the vastly growing Arabic corpus on the web. This paper presents a novel approach that targets Automatic Arabic Questions' Answering Systems whilst bypassing many featured challenges in the field. A hybrid approach that evaluates the effectiveness of using LOD to automatically answer Arabic questions is developed. The approach is developed to map users' questions in Modern Standard Arabic, to a standard query language for LOD (i.e. SPARQL) through: (i) extracting entities from questions and linking them over the web using Named-Entity Recognition and Disambiguation (NER/NED), and (ii) extracting properties among extracted named entities using a dependency parsing approach integrated with Wikidata ontology. To evaluate our proposed system, an Arabic questions dataset was created including: (a) Question body in Arabic language, (b) Question type, (c) SPARQL Query formulation, and (d) Question answer. Evaluation results are promising with a Precision of 84%, a Recall of 81.3%, and an F-Measure of 82.8%.https://ieeexplore.ieee.org/document/8915781/Semantic webquestion answering systemsstructured datanatural language processingArabic language |
spellingShingle | Mohammad Al-Smadi Islam Al-Dalabih Yaser Jararweh Patrick Juola Leveraging Linked Open Data to Automatically Answer Arabic Questions IEEE Access Semantic web question answering systems structured data natural language processing Arabic language |
title | Leveraging Linked Open Data to Automatically Answer Arabic Questions |
title_full | Leveraging Linked Open Data to Automatically Answer Arabic Questions |
title_fullStr | Leveraging Linked Open Data to Automatically Answer Arabic Questions |
title_full_unstemmed | Leveraging Linked Open Data to Automatically Answer Arabic Questions |
title_short | Leveraging Linked Open Data to Automatically Answer Arabic Questions |
title_sort | leveraging linked open data to automatically answer arabic questions |
topic | Semantic web question answering systems structured data natural language processing Arabic language |
url | https://ieeexplore.ieee.org/document/8915781/ |
work_keys_str_mv | AT mohammadalsmadi leveraginglinkedopendatatoautomaticallyanswerarabicquestions AT islamaldalabih leveraginglinkedopendatatoautomaticallyanswerarabicquestions AT yaserjararweh leveraginglinkedopendatatoautomaticallyanswerarabicquestions AT patrickjuola leveraginglinkedopendatatoautomaticallyanswerarabicquestions |