MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification

This research proposes a deep learning method for classifying student behavior in classrooms that follow the professional learning community teaching approach. We collected data on five student activities: hand-raising, interacting, sitting, turning around, and writing. We used the sum of absolute d...

Full description

Bibliographic Details
Main Authors:	Somsawut Nindam, Seung-Hoon Na, Hyo Jong Lee
Format:	Article
Language:	English
Published:	MDPI AG 2023-12-01
Series:	Applied Sciences
Subjects:	lesson study professional learning community student behaviors classification video classification keyframe selection multi-feature fusion
Online Access:	https://www.mdpi.com/2076-3417/14/1/230

_version_	1797359116229804032
author	Somsawut Nindam Seung-Hoon Na Hyo Jong Lee
author_facet	Somsawut Nindam Seung-Hoon Na Hyo Jong Lee
author_sort	Somsawut Nindam
collection	DOAJ
description	This research proposes a deep learning method for classifying student behavior in classrooms that follow the professional learning community teaching approach. We collected data on five student activities: hand-raising, interacting, sitting, turning around, and writing. We used the sum of absolute differences (SAD) in the LUV color space to detect scene changes. The K-means algorithm was then applied to select keyframes using the computed SAD. Next, we extracted features using multiple pretrained deep learning models from the convolutional neural network family. The pretrained models considered were InceptionV3, ResNet50V2, VGG16, and EfficientNetB7. We leveraged feature fusion, incorporating optical flow features and data augmentation techniques, to increase the necessary spatial features of selected keyframes. Finally, we classified the students’ behavior using a deep sequence model based on the bidirectional long short-term memory network with an attention mechanism (BiLSTM-AT). The proposed method with the BiLSTM-AT model can recognize behaviors from our dataset with high accuracy, precision, recall, and F1-scores of 0.97, 0.97, and 0.97, respectively. The overall accuracy was 96.67%. This high efficiency demonstrates the potential of the proposed method for classifying student behavior in classrooms.
first_indexed	2024-03-08T15:11:23Z
format	Article
id	doaj.art-132ff405e8534685bad02003b76dffc4
institution	Directory Open Access Journal
issn	2076-3417
language	English
last_indexed	2024-03-08T15:11:23Z
publishDate	2023-12-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj.art-132ff405e8534685bad02003b76dffc42024-01-10T14:51:25ZengMDPI AGApplied Sciences2076-34172023-12-0114123010.3390/app14010230MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior ClassificationSomsawut Nindam0Seung-Hoon Na1Hyo Jong Lee2Division of Computer Science and Engineering, CAIIT, Jeonbuk National University, Jeonju 54896, Republic of KoreaDivision of Computer Science and Engineering, CAIIT, Jeonbuk National University, Jeonju 54896, Republic of KoreaDivision of Computer Science and Engineering, CAIIT, Jeonbuk National University, Jeonju 54896, Republic of KoreaThis research proposes a deep learning method for classifying student behavior in classrooms that follow the professional learning community teaching approach. We collected data on five student activities: hand-raising, interacting, sitting, turning around, and writing. We used the sum of absolute differences (SAD) in the LUV color space to detect scene changes. The K-means algorithm was then applied to select keyframes using the computed SAD. Next, we extracted features using multiple pretrained deep learning models from the convolutional neural network family. The pretrained models considered were InceptionV3, ResNet50V2, VGG16, and EfficientNetB7. We leveraged feature fusion, incorporating optical flow features and data augmentation techniques, to increase the necessary spatial features of selected keyframes. Finally, we classified the students’ behavior using a deep sequence model based on the bidirectional long short-term memory network with an attention mechanism (BiLSTM-AT). The proposed method with the BiLSTM-AT model can recognize behaviors from our dataset with high accuracy, precision, recall, and F1-scores of 0.97, 0.97, and 0.97, respectively. The overall accuracy was 96.67%. This high efficiency demonstrates the potential of the proposed method for classifying student behavior in classrooms.https://www.mdpi.com/2076-3417/14/1/230lesson studyprofessional learning communitystudent behaviors classificationvideo classificationkeyframe selectionmulti-feature fusion
spellingShingle	Somsawut Nindam Seung-Hoon Na Hyo Jong Lee MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification Applied Sciences lesson study professional learning community student behaviors classification video classification keyframe selection multi-feature fusion
title	MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification
title_full	MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification
title_fullStr	MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification
title_full_unstemmed	MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification
title_short	MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification
title_sort	multifusednet a multi feature fused network of pretrained vision models via keyframes for student behavior classification
topic	lesson study professional learning community student behaviors classification video classification keyframe selection multi-feature fusion
url	https://www.mdpi.com/2076-3417/14/1/230
work_keys_str_mv	AT somsawutnindam multifusednetamultifeaturefusednetworkofpretrainedvisionmodelsviakeyframesforstudentbehaviorclassification AT seunghoonna multifusednetamultifeaturefusednetworkofpretrainedvisionmodelsviakeyframesforstudentbehaviorclassification AT hyojonglee multifusednetamultifeaturefusednetworkofpretrainedvisionmodelsviakeyframesforstudentbehaviorclassification

MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification

Similar Items