Assignments as Influential Factor to Improve the Prediction of Student Performance in Online Courses

Studies on the prediction of student success in distance learning have explored mainly demographics factors and student interactions with the virtual learning environments. However, it is remarkable that a very limited number of studies use information about the assignments submitted by students as...

Full description

Bibliographic Details
Main Authors: Aurora Esteban, Cristóbal Romero, Amelia Zafra
Format: Article
Language:English
Published: MDPI AG 2021-10-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/21/10145
_version_ 1797512772480663552
author Aurora Esteban
Cristóbal Romero
Amelia Zafra
author_facet Aurora Esteban
Cristóbal Romero
Amelia Zafra
author_sort Aurora Esteban
collection DOAJ
description Studies on the prediction of student success in distance learning have explored mainly demographics factors and student interactions with the virtual learning environments. However, it is remarkable that a very limited number of studies use information about the assignments submitted by students as influential factor to predict their academic achievement. This paper aims to explore the real importance of assignment information for solving students’ performance prediction in distance learning and evaluate the beneficial effect of including this information. We investigate and compare this factor and its potential from two information representation approaches: the traditional representation based on single instances and a more flexible representation based on Multiple Instance Learning (MIL), focus on handle weakly labeled data. A comparative study is carried out using the Open University Learning Analytics dataset, one of the most important public datasets in education provided by one of the greatest online universities of United Kingdom. The study includes a wide set of different types of machine learning algorithms addressed from the two data representation commented, showing that algorithms using only information about assignments with a representation based on MIL can outperform more than 20% the accuracy with respect to a representation based on single instance learning. Thus, it is concluded that applying an appropriate representation that eliminates the sparseness of data allows to show the relevance of a factor, such as the assignments submitted, not widely used to date to predict students’ academic performance. Moreover, a comparison with previous works on the same dataset and problem shows that predictive models based on MIL using only assignments information obtain competitive results compared to previous studies that include other factors to predict students performance.
first_indexed 2024-03-10T06:06:25Z
format Article
id doaj.art-489849b8fbb1446c8d11a2663dea2cfb
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T06:06:25Z
publishDate 2021-10-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-489849b8fbb1446c8d11a2663dea2cfb2023-11-22T20:28:27ZengMDPI AGApplied Sciences2076-34172021-10-0111211014510.3390/app112110145Assignments as Influential Factor to Improve the Prediction of Student Performance in Online CoursesAurora Esteban0Cristóbal Romero1Amelia Zafra2Department of Computer Science and Numerical Analysis, Andalusian Research Institute in Data Science and Computational Intelligence (DaSCI), University of Cordoba, 14071 Cordoba, SpainDepartment of Computer Science and Numerical Analysis, Andalusian Research Institute in Data Science and Computational Intelligence (DaSCI), University of Cordoba, 14071 Cordoba, SpainDepartment of Computer Science and Numerical Analysis, Andalusian Research Institute in Data Science and Computational Intelligence (DaSCI), University of Cordoba, 14071 Cordoba, SpainStudies on the prediction of student success in distance learning have explored mainly demographics factors and student interactions with the virtual learning environments. However, it is remarkable that a very limited number of studies use information about the assignments submitted by students as influential factor to predict their academic achievement. This paper aims to explore the real importance of assignment information for solving students’ performance prediction in distance learning and evaluate the beneficial effect of including this information. We investigate and compare this factor and its potential from two information representation approaches: the traditional representation based on single instances and a more flexible representation based on Multiple Instance Learning (MIL), focus on handle weakly labeled data. A comparative study is carried out using the Open University Learning Analytics dataset, one of the most important public datasets in education provided by one of the greatest online universities of United Kingdom. The study includes a wide set of different types of machine learning algorithms addressed from the two data representation commented, showing that algorithms using only information about assignments with a representation based on MIL can outperform more than 20% the accuracy with respect to a representation based on single instance learning. Thus, it is concluded that applying an appropriate representation that eliminates the sparseness of data allows to show the relevance of a factor, such as the assignments submitted, not widely used to date to predict students’ academic performance. Moreover, a comparison with previous works on the same dataset and problem shows that predictive models based on MIL using only assignments information obtain competitive results compared to previous studies that include other factors to predict students performance.https://www.mdpi.com/2076-3417/11/21/10145Multiple Instance Learningeducational data miningOULADvirtual learning systempredicting performance
spellingShingle Aurora Esteban
Cristóbal Romero
Amelia Zafra
Assignments as Influential Factor to Improve the Prediction of Student Performance in Online Courses
Applied Sciences
Multiple Instance Learning
educational data mining
OULAD
virtual learning system
predicting performance
title Assignments as Influential Factor to Improve the Prediction of Student Performance in Online Courses
title_full Assignments as Influential Factor to Improve the Prediction of Student Performance in Online Courses
title_fullStr Assignments as Influential Factor to Improve the Prediction of Student Performance in Online Courses
title_full_unstemmed Assignments as Influential Factor to Improve the Prediction of Student Performance in Online Courses
title_short Assignments as Influential Factor to Improve the Prediction of Student Performance in Online Courses
title_sort assignments as influential factor to improve the prediction of student performance in online courses
topic Multiple Instance Learning
educational data mining
OULAD
virtual learning system
predicting performance
url https://www.mdpi.com/2076-3417/11/21/10145
work_keys_str_mv AT auroraesteban assignmentsasinfluentialfactortoimprovethepredictionofstudentperformanceinonlinecourses
AT cristobalromero assignmentsasinfluentialfactortoimprovethepredictionofstudentperformanceinonlinecourses
AT ameliazafra assignmentsasinfluentialfactortoimprovethepredictionofstudentperformanceinonlinecourses