Linguistic-based SPARQL translation model for semantic question answering system

Semantic Question Answering (SQA) aims to translate natural language (NL) questions to Simple Protocol and RDF Query Language (SPARQL) queries to retrieve answer from linked data. SQA deals with the complexity of NL questions because of the users’ styles of writing. Furthermore, the process to const...

Full description

Bibliographic Details
Main Authors: Sofian, Hazrina, Mohd Sharef, Nurfadhlina, Ibrahim, Hamidah, Azmi Murad, Masrah Azrifah, Mohd Noah, Shahrul Azman
Format: Article
Language:English
Published: American Scientific Publishers 2018
Online Access:http://psasir.upm.edu.my/id/eprint/64661/1/Linguistic-based%20SPARQL%20translation%20model%20for%20semantic%20question%20answering%20system.pdf
_version_ 1825932793746554880
author Sofian, Hazrina
Mohd Sharef, Nurfadhlina
Ibrahim, Hamidah
Azmi Murad, Masrah Azrifah
Mohd Noah, Shahrul Azman
author_facet Sofian, Hazrina
Mohd Sharef, Nurfadhlina
Ibrahim, Hamidah
Azmi Murad, Masrah Azrifah
Mohd Noah, Shahrul Azman
author_sort Sofian, Hazrina
collection UPM
description Semantic Question Answering (SQA) aims to translate natural language (NL) questions to Simple Protocol and RDF Query Language (SPARQL) queries to retrieve answer from linked data. SQA deals with the complexity of NL questions because of the users’ styles of writing. Furthermore, the process to construct the SPARQL query to retrieve answer from linked data is complex due to the different merging scenarios depending on the six meta-mapping aspects: (1) the question type; (2) the sequence of important POS tags; (3) the preposition occurrence (4) the datatype of the matched RDF triples; (5) the resource heterogeinity; (6) the structure of the matched RDF triples. To date, most existing researchers on SQA system have treated the focus for SQA system to accept complex NL question separately from the focus to address meta-mapping scenarios. The motivation of this study is to design and develop an SQA system that accepts complex NL questions while addressing the meta-mapping scenarios. This is vital because each user has their own idiosyncrasy in composing NL question which needs to be translated to SPARQL query that involve different merging meta-mapping scenarios. We designed the selective POS tag extraction technique and the semantic representation composition technique to handle the complex NL questions. Meanwhile, we formulated a new linguistic-based SPARQL translation model to address the meta-mapping scenarios. The model is formulated using our proposed QALD dataset analysis methodology which can also be used by other researchers to implement on any QALD dataset. Model-Driven Semantic Question Answering (MDSQA) system that is integrated with the two techniques and formulated model is developed to automate the translation of the NL questions to SPARQL queries. MDSQA is evaluated using the QALD-3 test dataset that consists of 100 NL questions as input. The output of the MDSQA are the constructed SPARQL queries. The evaluation results are derived by comparing the constructed SPARQL queries against the actual SPARQL queries provided by the QALD-3 test dataset. MDSQA is able to process all complex NL questions in QALD-3 which consist of simple and complex NL questions without any manual modification of the question. Based on precision and recall of answer type, SPARQL query form, number of triples, placement of triples and SPARQL condition, MDSQA is capable of addressing meta-mapping scenario. Further enhancement is needed to address the drawbacks of this approach.
first_indexed 2024-03-06T09:47:26Z
format Article
id upm.eprints-64661
institution Universiti Putra Malaysia
language English
last_indexed 2024-03-06T09:47:26Z
publishDate 2018
publisher American Scientific Publishers
record_format dspace
spelling upm.eprints-646612018-08-13T03:45:43Z http://psasir.upm.edu.my/id/eprint/64661/ Linguistic-based SPARQL translation model for semantic question answering system Sofian, Hazrina Mohd Sharef, Nurfadhlina Ibrahim, Hamidah Azmi Murad, Masrah Azrifah Mohd Noah, Shahrul Azman Semantic Question Answering (SQA) aims to translate natural language (NL) questions to Simple Protocol and RDF Query Language (SPARQL) queries to retrieve answer from linked data. SQA deals with the complexity of NL questions because of the users’ styles of writing. Furthermore, the process to construct the SPARQL query to retrieve answer from linked data is complex due to the different merging scenarios depending on the six meta-mapping aspects: (1) the question type; (2) the sequence of important POS tags; (3) the preposition occurrence (4) the datatype of the matched RDF triples; (5) the resource heterogeinity; (6) the structure of the matched RDF triples. To date, most existing researchers on SQA system have treated the focus for SQA system to accept complex NL question separately from the focus to address meta-mapping scenarios. The motivation of this study is to design and develop an SQA system that accepts complex NL questions while addressing the meta-mapping scenarios. This is vital because each user has their own idiosyncrasy in composing NL question which needs to be translated to SPARQL query that involve different merging meta-mapping scenarios. We designed the selective POS tag extraction technique and the semantic representation composition technique to handle the complex NL questions. Meanwhile, we formulated a new linguistic-based SPARQL translation model to address the meta-mapping scenarios. The model is formulated using our proposed QALD dataset analysis methodology which can also be used by other researchers to implement on any QALD dataset. Model-Driven Semantic Question Answering (MDSQA) system that is integrated with the two techniques and formulated model is developed to automate the translation of the NL questions to SPARQL queries. MDSQA is evaluated using the QALD-3 test dataset that consists of 100 NL questions as input. The output of the MDSQA are the constructed SPARQL queries. The evaluation results are derived by comparing the constructed SPARQL queries against the actual SPARQL queries provided by the QALD-3 test dataset. MDSQA is able to process all complex NL questions in QALD-3 which consist of simple and complex NL questions without any manual modification of the question. Based on precision and recall of answer type, SPARQL query form, number of triples, placement of triples and SPARQL condition, MDSQA is capable of addressing meta-mapping scenario. Further enhancement is needed to address the drawbacks of this approach. American Scientific Publishers 2018 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/64661/1/Linguistic-based%20SPARQL%20translation%20model%20for%20semantic%20question%20answering%20system.pdf Sofian, Hazrina and Mohd Sharef, Nurfadhlina and Ibrahim, Hamidah and Azmi Murad, Masrah Azrifah and Mohd Noah, Shahrul Azman (2018) Linguistic-based SPARQL translation model for semantic question answering system. Advanced Science Letters, 24 (2). pp. 1375-1381. ISSN 1936-6612; ESSN: 1936-7317 https://www.ingentaconnect.com/contentone/asp/asl/2018/00000024/00000002/art00127 10.1166/asl.2018.10753
spellingShingle Sofian, Hazrina
Mohd Sharef, Nurfadhlina
Ibrahim, Hamidah
Azmi Murad, Masrah Azrifah
Mohd Noah, Shahrul Azman
Linguistic-based SPARQL translation model for semantic question answering system
title Linguistic-based SPARQL translation model for semantic question answering system
title_full Linguistic-based SPARQL translation model for semantic question answering system
title_fullStr Linguistic-based SPARQL translation model for semantic question answering system
title_full_unstemmed Linguistic-based SPARQL translation model for semantic question answering system
title_short Linguistic-based SPARQL translation model for semantic question answering system
title_sort linguistic based sparql translation model for semantic question answering system
url http://psasir.upm.edu.my/id/eprint/64661/1/Linguistic-based%20SPARQL%20translation%20model%20for%20semantic%20question%20answering%20system.pdf
work_keys_str_mv AT sofianhazrina linguisticbasedsparqltranslationmodelforsemanticquestionansweringsystem
AT mohdsharefnurfadhlina linguisticbasedsparqltranslationmodelforsemanticquestionansweringsystem
AT ibrahimhamidah linguisticbasedsparqltranslationmodelforsemanticquestionansweringsystem
AT azmimuradmasrahazrifah linguisticbasedsparqltranslationmodelforsemanticquestionansweringsystem
AT mohdnoahshahrulazman linguisticbasedsparqltranslationmodelforsemanticquestionansweringsystem