Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages

In this paper, a novel approach to automatic question generation (AQG) using semantic role labeling (SRL) for morphologically rich languages is presented. A model for AQG is developed for our native speaking language, Croatian. Croatian language is a highly inflected language that belongs to Balto-S...

Full description

Bibliographic Details
Main Authors: Daniel Vasić*, Branko Žitko, Hrvoje Ljubić
Format: Article
Language:English
Published: Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek 2021-01-01
Series:Tehnički Vjesnik
Subjects:
Online Access:https://hrcak.srce.hr/file/375247
_version_ 1797207002063044608
author Daniel Vasić*
Branko Žitko
Hrvoje Ljubić
author_facet Daniel Vasić*
Branko Žitko
Hrvoje Ljubić
author_sort Daniel Vasić*
collection DOAJ
description In this paper, a novel approach to automatic question generation (AQG) using semantic role labeling (SRL) for morphologically rich languages is presented. A model for AQG is developed for our native speaking language, Croatian. Croatian language is a highly inflected language that belongs to Balto-Slavic family of languages. Globally this article can be divided into two stages. In the first stage we present a novel approach to SRL of texts written in Croatian language that uses Conditional Random Fields (CRF). SRL traditionally consists of predicate disambiguation, argument identification and argument classification. After these steps most approaches use beam search to find optimal sequence of arguments based on given predicate. We propose the architecture for predicate identification and argument classification in which finding the best sequence of arguments is handled by Viterbi decoding. We enrich SRL features with custom attributes that are custom made for this language. Our SRL system achieves F1 score of 78% in argument classification step on Croatian hr 500k corpus. In the second stage the proposed SRL model is used to develop AQG system for question generation from texts written in Croatian language. We proposed custom templates for AQG that were used to generate a total of 628 questions which were evaluated by experts scoring every question on a Likert scale. Expert evaluation of the system showed that our AQG achieved good results. The evaluation showed that 68% of the generated questions could be used for educational purposes. With these results the proposed AQG system could be used for possible implementation inside educational systems such as Intelligent Tutoring Systems.
first_indexed 2024-04-24T09:15:59Z
format Article
id doaj.art-640f359877164ed6bbdf8045266dac20
institution Directory Open Access Journal
issn 1330-3651
1848-6339
language English
last_indexed 2024-04-24T09:15:59Z
publishDate 2021-01-01
publisher Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek
record_format Article
series Tehnički Vjesnik
spelling doaj.art-640f359877164ed6bbdf8045266dac202024-04-15T16:56:25ZengFaculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in OsijekTehnički Vjesnik1330-36511848-63392021-01-0128373974510.17559/TV-20200402175619Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich LanguagesDaniel Vasić*0Branko Žitko1Hrvoje Ljubić2University of Mostar, Faculty of Science and Education, 88000 Mostar, Bosnia and HerzegovinaUniversity of Split, Faculty of Science, 10000 Split, CroatiaUniversity of Mostar, Faculty of Science and Education, 88000 Mostar, Bosnia and HerzegovinaIn this paper, a novel approach to automatic question generation (AQG) using semantic role labeling (SRL) for morphologically rich languages is presented. A model for AQG is developed for our native speaking language, Croatian. Croatian language is a highly inflected language that belongs to Balto-Slavic family of languages. Globally this article can be divided into two stages. In the first stage we present a novel approach to SRL of texts written in Croatian language that uses Conditional Random Fields (CRF). SRL traditionally consists of predicate disambiguation, argument identification and argument classification. After these steps most approaches use beam search to find optimal sequence of arguments based on given predicate. We propose the architecture for predicate identification and argument classification in which finding the best sequence of arguments is handled by Viterbi decoding. We enrich SRL features with custom attributes that are custom made for this language. Our SRL system achieves F1 score of 78% in argument classification step on Croatian hr 500k corpus. In the second stage the proposed SRL model is used to develop AQG system for question generation from texts written in Croatian language. We proposed custom templates for AQG that were used to generate a total of 628 questions which were evaluated by experts scoring every question on a Likert scale. Expert evaluation of the system showed that our AQG achieved good results. The evaluation showed that 68% of the generated questions could be used for educational purposes. With these results the proposed AQG system could be used for possible implementation inside educational systems such as Intelligent Tutoring Systems.https://hrcak.srce.hr/file/375247automatic question generationmorphologically rich languagesnatural language processingsemantic role labeling
spellingShingle Daniel Vasić*
Branko Žitko
Hrvoje Ljubić
Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages
Tehnički Vjesnik
automatic question generation
morphologically rich languages
natural language processing
semantic role labeling
title Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages
title_full Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages
title_fullStr Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages
title_full_unstemmed Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages
title_short Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages
title_sort automatic question generation using semantic role labeling for morphologically rich languages
topic automatic question generation
morphologically rich languages
natural language processing
semantic role labeling
url https://hrcak.srce.hr/file/375247
work_keys_str_mv AT danielvasic automaticquestiongenerationusingsemanticrolelabelingformorphologicallyrichlanguages
AT brankozitko automaticquestiongenerationusingsemanticrolelabelingformorphologicallyrichlanguages
AT hrvojeljubic automaticquestiongenerationusingsemanticrolelabelingformorphologicallyrichlanguages