Predicting the final grade using a machine learning regression model: insights from fifty percent of total course grades in CS1 courses

This article introduces a model for accurately predicting students’ final grades in the CS1 course by utilizing their grades from the first half of the course. The methodology includes three phases: training, testing, and validation, employing four regression algorithms: AdaBoost, Random Forest, Sup...

Full description

Bibliographic Details
Main Authors: Carlos Giovanny Hidalgo Suarez, Jose Llanos, Víctor A. Bucheli
Format: Article
Language:English
Published: PeerJ Inc. 2023-12-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-1689.pdf
Description
Summary:This article introduces a model for accurately predicting students’ final grades in the CS1 course by utilizing their grades from the first half of the course. The methodology includes three phases: training, testing, and validation, employing four regression algorithms: AdaBoost, Random Forest, Support Vector Regression (SVR), and XGBoost. Notably, the SVR algorithm outperformed the others, achieving an impressive R-squared (R2) value ranging from 72% to 91%. The discussion section focuses on four crucial aspects: the selection of data features and the percentage of course grades used for training, the comparison between predicted and actual values to demonstrate reliability, and the model’s performance compared to existing literature models, highlighting its effectiveness.
ISSN:2376-5992