Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages
In this paper, a novel approach to automatic question generation (AQG) using semantic role labeling (SRL) for morphologically rich languages is presented. A model for AQG is developed for our native speaking language, Croatian. Croatian language is a highly inflected language that belongs to Balto-S...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek
2021-01-01
|
Series: | Tehnički Vjesnik |
Subjects: | |
Online Access: | https://hrcak.srce.hr/file/375247 |
_version_ | 1797207002063044608 |
---|---|
author | Daniel Vasić* Branko Žitko Hrvoje Ljubić |
author_facet | Daniel Vasić* Branko Žitko Hrvoje Ljubić |
author_sort | Daniel Vasić* |
collection | DOAJ |
description | In this paper, a novel approach to automatic question generation (AQG) using semantic role labeling (SRL) for morphologically rich languages is presented. A model for AQG is developed for our native speaking language, Croatian. Croatian language is a highly inflected language that belongs to Balto-Slavic family of languages. Globally this article can be divided into two stages. In the first stage we present a novel approach to SRL of texts written in Croatian language that uses Conditional Random Fields (CRF). SRL traditionally consists of predicate disambiguation, argument identification and argument classification. After these steps most approaches use beam search to find optimal sequence of arguments based on given predicate. We propose the architecture for predicate identification and argument classification in which finding the best sequence of arguments is handled by Viterbi decoding. We enrich SRL features with custom attributes that are custom made for this language. Our SRL system achieves F1 score of 78% in argument classification step on Croatian hr 500k corpus. In the second stage the proposed SRL model is used to develop AQG system for question generation from texts written in Croatian language. We proposed custom templates for AQG that were used to generate a total of 628 questions which were evaluated by experts scoring every question on a Likert scale. Expert evaluation of the system showed that our AQG achieved good results. The evaluation showed that 68% of the generated questions could be used for educational purposes. With these results the proposed AQG system could be used for possible implementation inside educational systems such as Intelligent Tutoring Systems. |
first_indexed | 2024-04-24T09:15:59Z |
format | Article |
id | doaj.art-640f359877164ed6bbdf8045266dac20 |
institution | Directory Open Access Journal |
issn | 1330-3651 1848-6339 |
language | English |
last_indexed | 2024-04-24T09:15:59Z |
publishDate | 2021-01-01 |
publisher | Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek |
record_format | Article |
series | Tehnički Vjesnik |
spelling | doaj.art-640f359877164ed6bbdf8045266dac202024-04-15T16:56:25ZengFaculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in OsijekTehnički Vjesnik1330-36511848-63392021-01-0128373974510.17559/TV-20200402175619Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich LanguagesDaniel Vasić*0Branko Žitko1Hrvoje Ljubić2University of Mostar, Faculty of Science and Education, 88000 Mostar, Bosnia and HerzegovinaUniversity of Split, Faculty of Science, 10000 Split, CroatiaUniversity of Mostar, Faculty of Science and Education, 88000 Mostar, Bosnia and HerzegovinaIn this paper, a novel approach to automatic question generation (AQG) using semantic role labeling (SRL) for morphologically rich languages is presented. A model for AQG is developed for our native speaking language, Croatian. Croatian language is a highly inflected language that belongs to Balto-Slavic family of languages. Globally this article can be divided into two stages. In the first stage we present a novel approach to SRL of texts written in Croatian language that uses Conditional Random Fields (CRF). SRL traditionally consists of predicate disambiguation, argument identification and argument classification. After these steps most approaches use beam search to find optimal sequence of arguments based on given predicate. We propose the architecture for predicate identification and argument classification in which finding the best sequence of arguments is handled by Viterbi decoding. We enrich SRL features with custom attributes that are custom made for this language. Our SRL system achieves F1 score of 78% in argument classification step on Croatian hr 500k corpus. In the second stage the proposed SRL model is used to develop AQG system for question generation from texts written in Croatian language. We proposed custom templates for AQG that were used to generate a total of 628 questions which were evaluated by experts scoring every question on a Likert scale. Expert evaluation of the system showed that our AQG achieved good results. The evaluation showed that 68% of the generated questions could be used for educational purposes. With these results the proposed AQG system could be used for possible implementation inside educational systems such as Intelligent Tutoring Systems.https://hrcak.srce.hr/file/375247automatic question generationmorphologically rich languagesnatural language processingsemantic role labeling |
spellingShingle | Daniel Vasić* Branko Žitko Hrvoje Ljubić Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages Tehnički Vjesnik automatic question generation morphologically rich languages natural language processing semantic role labeling |
title | Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages |
title_full | Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages |
title_fullStr | Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages |
title_full_unstemmed | Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages |
title_short | Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages |
title_sort | automatic question generation using semantic role labeling for morphologically rich languages |
topic | automatic question generation morphologically rich languages natural language processing semantic role labeling |
url | https://hrcak.srce.hr/file/375247 |
work_keys_str_mv | AT danielvasic automaticquestiongenerationusingsemanticrolelabelingformorphologicallyrichlanguages AT brankozitko automaticquestiongenerationusingsemanticrolelabelingformorphologicallyrichlanguages AT hrvojeljubic automaticquestiongenerationusingsemanticrolelabelingformorphologicallyrichlanguages |