Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision
Weakly supervised video anomaly detection is a methodology that assesses anomaly levels in individual frames based on labeled video data. Anomaly scores are computed by evaluating the deviation of distances derived from frames in an unbiased state. Weakly supervised video anomaly detection encounter...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-12-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/24/1/58 |
_version_ | 1827384475123712000 |
---|---|
author | Junyeop Lee Hyunbon Koo Seongjun Kim Hanseok Ko |
author_facet | Junyeop Lee Hyunbon Koo Seongjun Kim Hanseok Ko |
author_sort | Junyeop Lee |
collection | DOAJ |
description | Weakly supervised video anomaly detection is a methodology that assesses anomaly levels in individual frames based on labeled video data. Anomaly scores are computed by evaluating the deviation of distances derived from frames in an unbiased state. Weakly supervised video anomaly detection encounters the formidable challenge of false alarms, stemming from various sources, with a major contributor being the inadequate reflection of frame labels during the learning process. Multiple instance learning has been a pivotal solution to this issue in previous studies, necessitating the identification of discernible features between abnormal and normal segments. Simultaneously, it is imperative to identify shared biases within the feature space and cultivate a representative model. In this study, we introduce a novel multiple instance learning framework anchored on a memory unit, which augments features based on memory and effectively bridges the gap between normal and abnormal instances. This augmentation is facilitated through the integration of an multi-head attention feature augmentation module and loss function with a KL divergence and a Gaussian distribution estimation-based approach. The method identifies distinguishable features and secures the inter-instance distance, thus fortifying the distance metrics between abnormal and normal instances approximated by distribution. The contribution of this research involves proposing a novel framework based on MIL for performing WSVAD and presenting an efficient integration strategy during the augmentation process. Extensive experiments were conducted on benchmark datasets XD-Violence and UCF-Crime to substantiate the effectiveness of the proposed model. |
first_indexed | 2024-03-08T14:58:21Z |
format | Article |
id | doaj.art-5bb796e5afaf4706a7b0be35c63e86aa |
institution | Directory Open Access Journal |
issn | 1424-8220 |
language | English |
last_indexed | 2024-03-08T14:58:21Z |
publishDate | 2023-12-01 |
publisher | MDPI AG |
record_format | Article |
series | Sensors |
spelling | doaj.art-5bb796e5afaf4706a7b0be35c63e86aa2024-01-10T15:08:23ZengMDPI AGSensors1424-82202023-12-012415810.3390/s24010058Cognitive Refined Augmentation for Video Anomaly Detection in Weak SupervisionJunyeop Lee0Hyunbon Koo1Seongjun Kim2Hanseok Ko3School of Electrical Engineering, Korea University, Seoul 02841, Republic of KoreaKorea Institute of Civil Engineering and Building Technology, Goyang-si 10223, Republic of KoreaKorea Institute of Civil Engineering and Building Technology, Goyang-si 10223, Republic of KoreaSchool of Electrical Engineering, Korea University, Seoul 02841, Republic of KoreaWeakly supervised video anomaly detection is a methodology that assesses anomaly levels in individual frames based on labeled video data. Anomaly scores are computed by evaluating the deviation of distances derived from frames in an unbiased state. Weakly supervised video anomaly detection encounters the formidable challenge of false alarms, stemming from various sources, with a major contributor being the inadequate reflection of frame labels during the learning process. Multiple instance learning has been a pivotal solution to this issue in previous studies, necessitating the identification of discernible features between abnormal and normal segments. Simultaneously, it is imperative to identify shared biases within the feature space and cultivate a representative model. In this study, we introduce a novel multiple instance learning framework anchored on a memory unit, which augments features based on memory and effectively bridges the gap between normal and abnormal instances. This augmentation is facilitated through the integration of an multi-head attention feature augmentation module and loss function with a KL divergence and a Gaussian distribution estimation-based approach. The method identifies distinguishable features and secures the inter-instance distance, thus fortifying the distance metrics between abnormal and normal instances approximated by distribution. The contribution of this research involves proposing a novel framework based on MIL for performing WSVAD and presenting an efficient integration strategy during the augmentation process. Extensive experiments were conducted on benchmark datasets XD-Violence and UCF-Crime to substantiate the effectiveness of the proposed model.https://www.mdpi.com/1424-8220/24/1/58weakly supervised video anomaly detectionfeature augmentationmultiple instance learning |
spellingShingle | Junyeop Lee Hyunbon Koo Seongjun Kim Hanseok Ko Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision Sensors weakly supervised video anomaly detection feature augmentation multiple instance learning |
title | Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision |
title_full | Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision |
title_fullStr | Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision |
title_full_unstemmed | Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision |
title_short | Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision |
title_sort | cognitive refined augmentation for video anomaly detection in weak supervision |
topic | weakly supervised video anomaly detection feature augmentation multiple instance learning |
url | https://www.mdpi.com/1424-8220/24/1/58 |
work_keys_str_mv | AT junyeoplee cognitiverefinedaugmentationforvideoanomalydetectioninweaksupervision AT hyunbonkoo cognitiverefinedaugmentationforvideoanomalydetectioninweaksupervision AT seongjunkim cognitiverefinedaugmentationforvideoanomalydetectioninweaksupervision AT hanseokko cognitiverefinedaugmentationforvideoanomalydetectioninweaksupervision |