Transformer based Answer-Aware Bengali Question Generation

Question generation (QG), the task of generating questions from text or other forms of data, a significant and challenging subject, has recently attracted more attention in natural language processing (NLP) due to its vast range of business, healthcare, and education applications through creating qu...

Full description

Bibliographic Details
Main Authors: Jannatul Ferdous Ruma, Tasmiah Tahsin Mayeesha, Rashedur M. Rahman
Format: Article
Language:English
Published: KeAi Communications Co., Ltd. 2023-06-01
Series:International Journal of Cognitive Computing in Engineering
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666307423000311
_version_ 1797377840535044096
author Jannatul Ferdous Ruma
Tasmiah Tahsin Mayeesha
Rashedur M. Rahman
author_facet Jannatul Ferdous Ruma
Tasmiah Tahsin Mayeesha
Rashedur M. Rahman
author_sort Jannatul Ferdous Ruma
collection DOAJ
description Question generation (QG), the task of generating questions from text or other forms of data, a significant and challenging subject, has recently attracted more attention in natural language processing (NLP) due to its vast range of business, healthcare, and education applications through creating quizzes, Frequently Asked Questions (FAQs) and documentation. Most QG research has been conducted in languages with abundant resources, such as English. However, due to the dearth of training data in low-resource languages, such as Bengali, thorough research on Bengali question generation has yet to be conducted. In this article, we propose a system for producing varied and pertinent Bengali questions from context passages in natural language in an answer-aware input format using a series of fine-tuned text-to-text transformer (T5) based models. During our studies with various transformer-based encoder-decoder models and various decoding processes, along with delivering 98% grammatically accurate questions, our fine-tuned BanglaT5 model had the highest 35.77 F-score in RougeL and 38.57 BLEU-1 score with beam search. Our automated and human evaluation results show that our answer-aware QG models can create realistic, human-like questions relevant to the context passage and answer. We also release our code, generated questions, dataset, and models to enable broader question generation research for the Bengali-speaking community.
first_indexed 2024-03-08T19:58:09Z
format Article
id doaj.art-6ac0a7bdff784c67988fd60e27214247
institution Directory Open Access Journal
issn 2666-3074
language English
last_indexed 2024-03-08T19:58:09Z
publishDate 2023-06-01
publisher KeAi Communications Co., Ltd.
record_format Article
series International Journal of Cognitive Computing in Engineering
spelling doaj.art-6ac0a7bdff784c67988fd60e272142472023-12-24T04:46:43ZengKeAi Communications Co., Ltd.International Journal of Cognitive Computing in Engineering2666-30742023-06-014314326Transformer based Answer-Aware Bengali Question GenerationJannatul Ferdous Ruma0Tasmiah Tahsin Mayeesha1Rashedur M. Rahman2Department of Electrical & Computer Engineering, North South University, Dhaka 1229, BangladeshDepartment of Electrical & Computer Engineering, North South University, Dhaka 1229, BangladeshCorresponding author.; Department of Electrical & Computer Engineering, North South University, Dhaka 1229, BangladeshQuestion generation (QG), the task of generating questions from text or other forms of data, a significant and challenging subject, has recently attracted more attention in natural language processing (NLP) due to its vast range of business, healthcare, and education applications through creating quizzes, Frequently Asked Questions (FAQs) and documentation. Most QG research has been conducted in languages with abundant resources, such as English. However, due to the dearth of training data in low-resource languages, such as Bengali, thorough research on Bengali question generation has yet to be conducted. In this article, we propose a system for producing varied and pertinent Bengali questions from context passages in natural language in an answer-aware input format using a series of fine-tuned text-to-text transformer (T5) based models. During our studies with various transformer-based encoder-decoder models and various decoding processes, along with delivering 98% grammatically accurate questions, our fine-tuned BanglaT5 model had the highest 35.77 F-score in RougeL and 38.57 BLEU-1 score with beam search. Our automated and human evaluation results show that our answer-aware QG models can create realistic, human-like questions relevant to the context passage and answer. We also release our code, generated questions, dataset, and models to enable broader question generation research for the Bengali-speaking community.http://www.sciencedirect.com/science/article/pii/S2666307423000311Question GenerationNatural Language ProcessingText-to-text transformerBanglaT5
spellingShingle Jannatul Ferdous Ruma
Tasmiah Tahsin Mayeesha
Rashedur M. Rahman
Transformer based Answer-Aware Bengali Question Generation
International Journal of Cognitive Computing in Engineering
Question Generation
Natural Language Processing
Text-to-text transformer
BanglaT5
title Transformer based Answer-Aware Bengali Question Generation
title_full Transformer based Answer-Aware Bengali Question Generation
title_fullStr Transformer based Answer-Aware Bengali Question Generation
title_full_unstemmed Transformer based Answer-Aware Bengali Question Generation
title_short Transformer based Answer-Aware Bengali Question Generation
title_sort transformer based answer aware bengali question generation
topic Question Generation
Natural Language Processing
Text-to-text transformer
BanglaT5
url http://www.sciencedirect.com/science/article/pii/S2666307423000311
work_keys_str_mv AT jannatulferdousruma transformerbasedanswerawarebengaliquestiongeneration
AT tasmiahtahsinmayeesha transformerbasedanswerawarebengaliquestiongeneration
AT rashedurmrahman transformerbasedanswerawarebengaliquestiongeneration