Predictive model selection for completion rate in Massive Open Online Courses

In this paper we introduce an approach for selecting a linear model to estimate, in a predictive way, the completion rate of massive open online courses (MOOCs). Data are derived from LMS analytics and nominal surveys. The sample comprises 723 observations (users) carried out in seven courses on...

Full description

Bibliographic Details
Main Authors: Annamaria De Santis, Katia Sannicandro, Claudia Bellini, Tommaso Minerva
Format: Article
Language:English
Published: Italian e-Learning Association 2019-10-01
Series:Je-LKS: Journal of E-Learning and Knowledge Society
Subjects:
Online Access:http://je-lks.org/ojs/index.php/Je-LKS_EN/article/view/1135034
_version_ 1818470359569006592
author Annamaria De Santis
Katia Sannicandro
Claudia Bellini
Tommaso Minerva
author_facet Annamaria De Santis
Katia Sannicandro
Claudia Bellini
Tommaso Minerva
author_sort Annamaria De Santis
collection DOAJ
description In this paper we introduce an approach for selecting a linear model to estimate, in a predictive way, the completion rate of massive open online courses (MOOCs). Data are derived from LMS analytics and nominal surveys. The sample comprises 723 observations (users) carried out in seven courses on EduOpen, the Italian MOOCs platform. We used 24 independent variables (predictors), categorised into four groups (User Profile, User Engagement, User Behaviour, Course Profile). As response variables we examined both the course completion status and the completion rate of the learning activities. A first analysis concerned the correlation between the predictors within each group and between the different groups, as well as that between all the dependent variables and the two response variables. The linear regression analysis was conducted by means of a stepwise approach for model selection using the asymptotic information criterion (AIC). For each of the response variables we estimated predictive models using the different groups of predictors both separately and in combination. The models were validated using the usual statistical tests. The main results suggest a high degree of dependence of course completion and completion rate on variables measuring the user’s behavioural profile in the course and a weak degree of dependence on the user’s profile, motivation and course pattern. In addition, residual analysis indicates the potential occurrence of interaction effects among variables and non-linear dynamics.
first_indexed 2024-04-13T21:36:08Z
format Article
id doaj.art-7566dc9c531b45e1a6c3665f6408f08a
institution Directory Open Access Journal
issn 1826-6223
1971-8829
language English
last_indexed 2024-04-13T21:36:08Z
publishDate 2019-10-01
publisher Italian e-Learning Association
record_format Article
series Je-LKS: Journal of E-Learning and Knowledge Society
spelling doaj.art-7566dc9c531b45e1a6c3665f6408f08a2022-12-22T02:28:57ZengItalian e-Learning AssociationJe-LKS: Journal of E-Learning and Knowledge Society1826-62231971-88292019-10-0115310.20368/1971-8829/1135034Predictive model selection for completion rate in Massive Open Online CoursesAnnamaria De Santis0Katia Sannicandro1Claudia Bellini2Tommaso Minerva3University of Modena and Reggio EmiliaUniversity of Modena and Reggio EmiliaUniversity of Modena and Reggio EmiliaUniversity of Modena and Reggio EmiliaIn this paper we introduce an approach for selecting a linear model to estimate, in a predictive way, the completion rate of massive open online courses (MOOCs). Data are derived from LMS analytics and nominal surveys. The sample comprises 723 observations (users) carried out in seven courses on EduOpen, the Italian MOOCs platform. We used 24 independent variables (predictors), categorised into four groups (User Profile, User Engagement, User Behaviour, Course Profile). As response variables we examined both the course completion status and the completion rate of the learning activities. A first analysis concerned the correlation between the predictors within each group and between the different groups, as well as that between all the dependent variables and the two response variables. The linear regression analysis was conducted by means of a stepwise approach for model selection using the asymptotic information criterion (AIC). For each of the response variables we estimated predictive models using the different groups of predictors both separately and in combination. The models were validated using the usual statistical tests. The main results suggest a high degree of dependence of course completion and completion rate on variables measuring the user’s behavioural profile in the course and a weak degree of dependence on the user’s profile, motivation and course pattern. In addition, residual analysis indicates the potential occurrence of interaction effects among variables and non-linear dynamics.http://je-lks.org/ojs/index.php/Je-LKS_EN/article/view/1135034Learning AnalyticsMOOCsPredictive ModelsCourse Completion
spellingShingle Annamaria De Santis
Katia Sannicandro
Claudia Bellini
Tommaso Minerva
Predictive model selection for completion rate in Massive Open Online Courses
Je-LKS: Journal of E-Learning and Knowledge Society
Learning Analytics
MOOCs
Predictive Models
Course Completion
title Predictive model selection for completion rate in Massive Open Online Courses
title_full Predictive model selection for completion rate in Massive Open Online Courses
title_fullStr Predictive model selection for completion rate in Massive Open Online Courses
title_full_unstemmed Predictive model selection for completion rate in Massive Open Online Courses
title_short Predictive model selection for completion rate in Massive Open Online Courses
title_sort predictive model selection for completion rate in massive open online courses
topic Learning Analytics
MOOCs
Predictive Models
Course Completion
url http://je-lks.org/ojs/index.php/Je-LKS_EN/article/view/1135034
work_keys_str_mv AT annamariadesantis predictivemodelselectionforcompletionrateinmassiveopenonlinecourses
AT katiasannicandro predictivemodelselectionforcompletionrateinmassiveopenonlinecourses
AT claudiabellini predictivemodelselectionforcompletionrateinmassiveopenonlinecourses
AT tommasominerva predictivemodelselectionforcompletionrateinmassiveopenonlinecourses