Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision

Weakly supervised video anomaly detection is a methodology that assesses anomaly levels in individual frames based on labeled video data. Anomaly scores are computed by evaluating the deviation of distances derived from frames in an unbiased state. Weakly supervised video anomaly detection encounter...

Full description

Bibliographic Details
Main Authors: Junyeop Lee, Hyunbon Koo, Seongjun Kim, Hanseok Ko
Format: Article
Language:English
Published: MDPI AG 2023-12-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/24/1/58
_version_ 1827384475123712000
author Junyeop Lee
Hyunbon Koo
Seongjun Kim
Hanseok Ko
author_facet Junyeop Lee
Hyunbon Koo
Seongjun Kim
Hanseok Ko
author_sort Junyeop Lee
collection DOAJ
description Weakly supervised video anomaly detection is a methodology that assesses anomaly levels in individual frames based on labeled video data. Anomaly scores are computed by evaluating the deviation of distances derived from frames in an unbiased state. Weakly supervised video anomaly detection encounters the formidable challenge of false alarms, stemming from various sources, with a major contributor being the inadequate reflection of frame labels during the learning process. Multiple instance learning has been a pivotal solution to this issue in previous studies, necessitating the identification of discernible features between abnormal and normal segments. Simultaneously, it is imperative to identify shared biases within the feature space and cultivate a representative model. In this study, we introduce a novel multiple instance learning framework anchored on a memory unit, which augments features based on memory and effectively bridges the gap between normal and abnormal instances. This augmentation is facilitated through the integration of an multi-head attention feature augmentation module and loss function with a KL divergence and a Gaussian distribution estimation-based approach. The method identifies distinguishable features and secures the inter-instance distance, thus fortifying the distance metrics between abnormal and normal instances approximated by distribution. The contribution of this research involves proposing a novel framework based on MIL for performing WSVAD and presenting an efficient integration strategy during the augmentation process. Extensive experiments were conducted on benchmark datasets XD-Violence and UCF-Crime to substantiate the effectiveness of the proposed model.
first_indexed 2024-03-08T14:58:21Z
format Article
id doaj.art-5bb796e5afaf4706a7b0be35c63e86aa
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-08T14:58:21Z
publishDate 2023-12-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-5bb796e5afaf4706a7b0be35c63e86aa2024-01-10T15:08:23ZengMDPI AGSensors1424-82202023-12-012415810.3390/s24010058Cognitive Refined Augmentation for Video Anomaly Detection in Weak SupervisionJunyeop Lee0Hyunbon Koo1Seongjun Kim2Hanseok Ko3School of Electrical Engineering, Korea University, Seoul 02841, Republic of KoreaKorea Institute of Civil Engineering and Building Technology, Goyang-si 10223, Republic of KoreaKorea Institute of Civil Engineering and Building Technology, Goyang-si 10223, Republic of KoreaSchool of Electrical Engineering, Korea University, Seoul 02841, Republic of KoreaWeakly supervised video anomaly detection is a methodology that assesses anomaly levels in individual frames based on labeled video data. Anomaly scores are computed by evaluating the deviation of distances derived from frames in an unbiased state. Weakly supervised video anomaly detection encounters the formidable challenge of false alarms, stemming from various sources, with a major contributor being the inadequate reflection of frame labels during the learning process. Multiple instance learning has been a pivotal solution to this issue in previous studies, necessitating the identification of discernible features between abnormal and normal segments. Simultaneously, it is imperative to identify shared biases within the feature space and cultivate a representative model. In this study, we introduce a novel multiple instance learning framework anchored on a memory unit, which augments features based on memory and effectively bridges the gap between normal and abnormal instances. This augmentation is facilitated through the integration of an multi-head attention feature augmentation module and loss function with a KL divergence and a Gaussian distribution estimation-based approach. The method identifies distinguishable features and secures the inter-instance distance, thus fortifying the distance metrics between abnormal and normal instances approximated by distribution. The contribution of this research involves proposing a novel framework based on MIL for performing WSVAD and presenting an efficient integration strategy during the augmentation process. Extensive experiments were conducted on benchmark datasets XD-Violence and UCF-Crime to substantiate the effectiveness of the proposed model.https://www.mdpi.com/1424-8220/24/1/58weakly supervised video anomaly detectionfeature augmentationmultiple instance learning
spellingShingle Junyeop Lee
Hyunbon Koo
Seongjun Kim
Hanseok Ko
Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision
Sensors
weakly supervised video anomaly detection
feature augmentation
multiple instance learning
title Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision
title_full Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision
title_fullStr Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision
title_full_unstemmed Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision
title_short Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision
title_sort cognitive refined augmentation for video anomaly detection in weak supervision
topic weakly supervised video anomaly detection
feature augmentation
multiple instance learning
url https://www.mdpi.com/1424-8220/24/1/58
work_keys_str_mv AT junyeoplee cognitiverefinedaugmentationforvideoanomalydetectioninweaksupervision
AT hyunbonkoo cognitiverefinedaugmentationforvideoanomalydetectioninweaksupervision
AT seongjunkim cognitiverefinedaugmentationforvideoanomalydetectioninweaksupervision
AT hanseokko cognitiverefinedaugmentationforvideoanomalydetectioninweaksupervision