Co-Training Method Based on Semi-Decoupling Features for MOOC Learner Behavior Prediction

Facing the problem of massive unlabeled data and limited labeled samples, semi-supervised learning is favored, especially co-training. Standard co-training requires sufficiently redundant and conditionally independent dual views; however, in fact, few dual views exist that satisfy this condition. To...

Full description

Bibliographic Details
Main Authors:	Huanhuan Wang, Libo Xu, Zhenrui Huang, Jiagong Wang
Format:	Article
Language:	English
Published:	MDPI AG 2022-05-01
Series:	Axioms
Subjects:	semi-supervised co-training semi-decoupling feature importance Pearson correlation coefficient
Online Access:	https://www.mdpi.com/2075-1680/11/5/223

_version_	1797501606667747328
author	Huanhuan Wang Libo Xu Zhenrui Huang Jiagong Wang
author_facet	Huanhuan Wang Libo Xu Zhenrui Huang Jiagong Wang
author_sort	Huanhuan Wang
collection	DOAJ
description	Facing the problem of massive unlabeled data and limited labeled samples, semi-supervised learning is favored, especially co-training. Standard co-training requires sufficiently redundant and conditionally independent dual views; however, in fact, few dual views exist that satisfy this condition. To solve this problem, we propose a co-training method based on semi-decoupling features, that is, semi-decoupling features based on a known single view and then constructing independent and redundant dual views: (1) take a small number of important features as shared features of the dual views according to the importance of the features; (2) separate the remaining features one by one or in small batches according to the correlation between the features to make “divergent” features of the dual views; (3) combine the shared features and the “divergent” features to construct dual views. In this paper, the experimental dataset was from the edX dataset jointly released by Harvard University and MIT; the evaluation metrics adopted <i>F</i>1, <i>Precision</i>, and <i>Recall</i>. The analysis methods included three experiments: multiple models, iterations, and hyperparameters. The experimental results show that the effect of this model on MOOC learner behavior prediction was better than the other models, and the best prediction result was obtained in iteration 2. These all verify the effectiveness and superiority of this algorithm and provide a scientific and feasible reference for the development of the future education industry.
first_indexed	2024-03-10T03:20:57Z
format	Article
id	doaj.art-e3939cbcc1f54d48b66748e06ae1bcfd
institution	Directory Open Access Journal
issn	2075-1680
language	English
last_indexed	2024-03-10T03:20:57Z
publishDate	2022-05-01
publisher	MDPI AG
record_format	Article
series	Axioms
spelling	doaj.art-e3939cbcc1f54d48b66748e06ae1bcfd2023-11-23T10:04:18ZengMDPI AGAxioms2075-16802022-05-0111522310.3390/axioms11050223Co-Training Method Based on Semi-Decoupling Features for MOOC Learner Behavior PredictionHuanhuan Wang0Libo Xu1Zhenrui Huang2Jiagong Wang3School of Computing and Data Engineering, NingboTech University, Ningbo 315000, ChinaSchool of Computing and Data Engineering, NingboTech University, Ningbo 315000, ChinaSchool of Computing and Data Engineering, NingboTech University, Ningbo 315000, ChinaSchool of Computing and Data Engineering, NingboTech University, Ningbo 315000, ChinaFacing the problem of massive unlabeled data and limited labeled samples, semi-supervised learning is favored, especially co-training. Standard co-training requires sufficiently redundant and conditionally independent dual views; however, in fact, few dual views exist that satisfy this condition. To solve this problem, we propose a co-training method based on semi-decoupling features, that is, semi-decoupling features based on a known single view and then constructing independent and redundant dual views: (1) take a small number of important features as shared features of the dual views according to the importance of the features; (2) separate the remaining features one by one or in small batches according to the correlation between the features to make “divergent” features of the dual views; (3) combine the shared features and the “divergent” features to construct dual views. In this paper, the experimental dataset was from the edX dataset jointly released by Harvard University and MIT; the evaluation metrics adopted <i>F</i>1, <i>Precision</i>, and <i>Recall</i>. The analysis methods included three experiments: multiple models, iterations, and hyperparameters. The experimental results show that the effect of this model on MOOC learner behavior prediction was better than the other models, and the best prediction result was obtained in iteration 2. These all verify the effectiveness and superiority of this algorithm and provide a scientific and feasible reference for the development of the future education industry.https://www.mdpi.com/2075-1680/11/5/223semi-supervisedco-trainingsemi-decouplingfeature importancePearson correlation coefficient
spellingShingle	Huanhuan Wang Libo Xu Zhenrui Huang Jiagong Wang Co-Training Method Based on Semi-Decoupling Features for MOOC Learner Behavior Prediction Axioms semi-supervised co-training semi-decoupling feature importance Pearson correlation coefficient
title	Co-Training Method Based on Semi-Decoupling Features for MOOC Learner Behavior Prediction
title_full	Co-Training Method Based on Semi-Decoupling Features for MOOC Learner Behavior Prediction
title_fullStr	Co-Training Method Based on Semi-Decoupling Features for MOOC Learner Behavior Prediction
title_full_unstemmed	Co-Training Method Based on Semi-Decoupling Features for MOOC Learner Behavior Prediction
title_short	Co-Training Method Based on Semi-Decoupling Features for MOOC Learner Behavior Prediction
title_sort	co training method based on semi decoupling features for mooc learner behavior prediction
topic	semi-supervised co-training semi-decoupling feature importance Pearson correlation coefficient
url	https://www.mdpi.com/2075-1680/11/5/223
work_keys_str_mv	AT huanhuanwang cotrainingmethodbasedonsemidecouplingfeaturesformooclearnerbehaviorprediction AT liboxu cotrainingmethodbasedonsemidecouplingfeaturesformooclearnerbehaviorprediction AT zhenruihuang cotrainingmethodbasedonsemidecouplingfeaturesformooclearnerbehaviorprediction AT jiagongwang cotrainingmethodbasedonsemidecouplingfeaturesformooclearnerbehaviorprediction

Co-Training Method Based on Semi-Decoupling Features for MOOC Learner Behavior Prediction

Similar Items