SMFSOP: A semantic-based modelling framework for student outcome prediction

Over the past two decades, studying the various factors affecting student performance became essential. Knowing these factors assist in enhancing student’s performance, teaching practices and policy decisions. This research proposes a framework named “Semantic-based Modeling Framework for Student Ou...

Full description

Bibliographic Details
Main Authors: Yomna M.I. Hassan, Abeer Elkorany, Khaled Wassif
Format: Article
Language:English
Published: Elsevier 2023-09-01
Series:Journal of King Saud University: Computer and Information Sciences
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1319157823002823
_version_ 1797663889429626880
author Yomna M.I. Hassan
Abeer Elkorany
Khaled Wassif
author_facet Yomna M.I. Hassan
Abeer Elkorany
Khaled Wassif
author_sort Yomna M.I. Hassan
collection DOAJ
description Over the past two decades, studying the various factors affecting student performance became essential. Knowing these factors assist in enhancing student’s performance, teaching practices and policy decisions. This research proposes a framework named “Semantic-based Modeling Framework for Student Outcome Prediction” (SMFSOP), to automatically map students’ activities within their learning environment to a standardized behavioral model (Community of Inquiry model (CoI)). The generated student representation is utilized to cluster students and predict an outcome based on their cluster. The framework is divided into three phases: Data gathering and pre-processing, automated mapping, clustering and prediction. The automatic mapping uses semantic similarity between student attribute names/descriptions, and CoI model indicators. Path and BERT similarities were identified as the best performers compared to human annotators. K-means, DBSCAN, and Kernel K-means are used for the clustering step, followed by LassoCV for regression-based prediction, & K-nearest neighbors for classification-based prediction. In order to prove that the proposed framework is generally applicable, three real life datasets were used as a case study. Best-performing trials enhanced outcome prediction as follows: In StudentLife Dataset, Adjusted R2 is enhanced by 3% (95% to 98%), and MSE decreased by 2.375 % (0.126 to 0.031). In social network dataset, Adjusted R2 was enhanced by 17% (65% to 82%). The MSE decreased by 4.4% (0.164 to 0.12). For the “Open university learning Analytics dataset” (OULAD), accuracy is improved by 1.56%, F1-score enhanced by 0.014. Precision is enhanced by 3.1%.
first_indexed 2024-03-11T19:21:19Z
format Article
id doaj.art-9e303b5cfa0841df8b66def11ea127db
institution Directory Open Access Journal
issn 1319-1578
language English
last_indexed 2024-03-11T19:21:19Z
publishDate 2023-09-01
publisher Elsevier
record_format Article
series Journal of King Saud University: Computer and Information Sciences
spelling doaj.art-9e303b5cfa0841df8b66def11ea127db2023-10-07T04:34:11ZengElsevierJournal of King Saud University: Computer and Information Sciences1319-15782023-09-01358101728SMFSOP: A semantic-based modelling framework for student outcome predictionYomna M.I. Hassan0Abeer Elkorany1Khaled Wassif2Faculty of Mathematics and Computer Sciences, Universities of Canada, New Capital, Cairo, Egypt; Faculty of Computers and Artificial Intelligence, Cairo University, Dokki, Cairo, Egypt; Corresponding author.Faculty of Computers and Artificial Intelligence, Cairo University, Dokki, Cairo, EgyptFaculty of Computers and Artificial Intelligence, Cairo University, Dokki, Cairo, EgyptOver the past two decades, studying the various factors affecting student performance became essential. Knowing these factors assist in enhancing student’s performance, teaching practices and policy decisions. This research proposes a framework named “Semantic-based Modeling Framework for Student Outcome Prediction” (SMFSOP), to automatically map students’ activities within their learning environment to a standardized behavioral model (Community of Inquiry model (CoI)). The generated student representation is utilized to cluster students and predict an outcome based on their cluster. The framework is divided into three phases: Data gathering and pre-processing, automated mapping, clustering and prediction. The automatic mapping uses semantic similarity between student attribute names/descriptions, and CoI model indicators. Path and BERT similarities were identified as the best performers compared to human annotators. K-means, DBSCAN, and Kernel K-means are used for the clustering step, followed by LassoCV for regression-based prediction, & K-nearest neighbors for classification-based prediction. In order to prove that the proposed framework is generally applicable, three real life datasets were used as a case study. Best-performing trials enhanced outcome prediction as follows: In StudentLife Dataset, Adjusted R2 is enhanced by 3% (95% to 98%), and MSE decreased by 2.375 % (0.126 to 0.031). In social network dataset, Adjusted R2 was enhanced by 17% (65% to 82%). The MSE decreased by 4.4% (0.164 to 0.12). For the “Open university learning Analytics dataset” (OULAD), accuracy is improved by 1.56%, F1-score enhanced by 0.014. Precision is enhanced by 3.1%.http://www.sciencedirect.com/science/article/pii/S131915782300282300001111
spellingShingle Yomna M.I. Hassan
Abeer Elkorany
Khaled Wassif
SMFSOP: A semantic-based modelling framework for student outcome prediction
Journal of King Saud University: Computer and Information Sciences
0000
1111
title SMFSOP: A semantic-based modelling framework for student outcome prediction
title_full SMFSOP: A semantic-based modelling framework for student outcome prediction
title_fullStr SMFSOP: A semantic-based modelling framework for student outcome prediction
title_full_unstemmed SMFSOP: A semantic-based modelling framework for student outcome prediction
title_short SMFSOP: A semantic-based modelling framework for student outcome prediction
title_sort smfsop a semantic based modelling framework for student outcome prediction
topic 0000
1111
url http://www.sciencedirect.com/science/article/pii/S1319157823002823
work_keys_str_mv AT yomnamihassan smfsopasemanticbasedmodellingframeworkforstudentoutcomeprediction
AT abeerelkorany smfsopasemanticbasedmodellingframeworkforstudentoutcomeprediction
AT khaledwassif smfsopasemanticbasedmodellingframeworkforstudentoutcomeprediction