MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification

This research proposes a deep learning method for classifying student behavior in classrooms that follow the professional learning community teaching approach. We collected data on five student activities: hand-raising, interacting, sitting, turning around, and writing. We used the sum of absolute d...

Full description

Bibliographic Details
Main Authors: Somsawut Nindam, Seung-Hoon Na, Hyo Jong Lee
Format: Article
Language:English
Published: MDPI AG 2023-12-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/14/1/230
_version_ 1797359116229804032
author Somsawut Nindam
Seung-Hoon Na
Hyo Jong Lee
author_facet Somsawut Nindam
Seung-Hoon Na
Hyo Jong Lee
author_sort Somsawut Nindam
collection DOAJ
description This research proposes a deep learning method for classifying student behavior in classrooms that follow the professional learning community teaching approach. We collected data on five student activities: hand-raising, interacting, sitting, turning around, and writing. We used the sum of absolute differences (SAD) in the LUV color space to detect scene changes. The K-means algorithm was then applied to select keyframes using the computed SAD. Next, we extracted features using multiple pretrained deep learning models from the convolutional neural network family. The pretrained models considered were InceptionV3, ResNet50V2, VGG16, and EfficientNetB7. We leveraged feature fusion, incorporating optical flow features and data augmentation techniques, to increase the necessary spatial features of selected keyframes. Finally, we classified the students’ behavior using a deep sequence model based on the bidirectional long short-term memory network with an attention mechanism (BiLSTM-AT). The proposed method with the BiLSTM-AT model can recognize behaviors from our dataset with high accuracy, precision, recall, and F1-scores of 0.97, 0.97, and 0.97, respectively. The overall accuracy was 96.67%. This high efficiency demonstrates the potential of the proposed method for classifying student behavior in classrooms.
first_indexed 2024-03-08T15:11:23Z
format Article
id doaj.art-132ff405e8534685bad02003b76dffc4
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-08T15:11:23Z
publishDate 2023-12-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-132ff405e8534685bad02003b76dffc42024-01-10T14:51:25ZengMDPI AGApplied Sciences2076-34172023-12-0114123010.3390/app14010230MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior ClassificationSomsawut Nindam0Seung-Hoon Na1Hyo Jong Lee2Division of Computer Science and Engineering, CAIIT, Jeonbuk National University, Jeonju 54896, Republic of KoreaDivision of Computer Science and Engineering, CAIIT, Jeonbuk National University, Jeonju 54896, Republic of KoreaDivision of Computer Science and Engineering, CAIIT, Jeonbuk National University, Jeonju 54896, Republic of KoreaThis research proposes a deep learning method for classifying student behavior in classrooms that follow the professional learning community teaching approach. We collected data on five student activities: hand-raising, interacting, sitting, turning around, and writing. We used the sum of absolute differences (SAD) in the LUV color space to detect scene changes. The K-means algorithm was then applied to select keyframes using the computed SAD. Next, we extracted features using multiple pretrained deep learning models from the convolutional neural network family. The pretrained models considered were InceptionV3, ResNet50V2, VGG16, and EfficientNetB7. We leveraged feature fusion, incorporating optical flow features and data augmentation techniques, to increase the necessary spatial features of selected keyframes. Finally, we classified the students’ behavior using a deep sequence model based on the bidirectional long short-term memory network with an attention mechanism (BiLSTM-AT). The proposed method with the BiLSTM-AT model can recognize behaviors from our dataset with high accuracy, precision, recall, and F1-scores of 0.97, 0.97, and 0.97, respectively. The overall accuracy was 96.67%. This high efficiency demonstrates the potential of the proposed method for classifying student behavior in classrooms.https://www.mdpi.com/2076-3417/14/1/230lesson studyprofessional learning communitystudent behaviors classificationvideo classificationkeyframe selectionmulti-feature fusion
spellingShingle Somsawut Nindam
Seung-Hoon Na
Hyo Jong Lee
MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification
Applied Sciences
lesson study
professional learning community
student behaviors classification
video classification
keyframe selection
multi-feature fusion
title MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification
title_full MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification
title_fullStr MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification
title_full_unstemmed MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification
title_short MultiFusedNet: A Multi-Feature Fused Network of Pretrained Vision Models via Keyframes for Student Behavior Classification
title_sort multifusednet a multi feature fused network of pretrained vision models via keyframes for student behavior classification
topic lesson study
professional learning community
student behaviors classification
video classification
keyframe selection
multi-feature fusion
url https://www.mdpi.com/2076-3417/14/1/230
work_keys_str_mv AT somsawutnindam multifusednetamultifeaturefusednetworkofpretrainedvisionmodelsviakeyframesforstudentbehaviorclassification
AT seunghoonna multifusednetamultifeaturefusednetworkofpretrainedvisionmodelsviakeyframesforstudentbehaviorclassification
AT hyojonglee multifusednetamultifeaturefusednetworkofpretrainedvisionmodelsviakeyframesforstudentbehaviorclassification