A Hierarchical Spatial–Temporal Cross-Attention Scheme for Video Summarization Using Contrastive Learning
Video summarization (VS) is a widely used technique for facilitating the effective reading, fast comprehension, and effective retrieval of video content. Certain properties of the new video data, such as a lack of prominent emphasis and a fuzzy theme development border, disturb the original thinking...
Main Authors: | Xiaoyu Teng, Xiaolin Gui, Pan Xu, Jianglei Tong, Jian An, Yang Liu, Huilan Jiang |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-10-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/22/21/8275 |
Similar Items
-
Exploring Global Diversity and Local Context for Video Summarization
by: Yingchao Pan, et al.
Published: (2022-01-01) -
Summarization of Spanish Talk Shows with Siamese Hierarchical Attention Networks
by: J.-A. González, et al.
Published: (2019-09-01) -
From video summarization to real time video summarization in smart cities and beyond: A survey
by: Prashant Giridhar Shambharkar, et al.
Published: (2023-01-01) -
Wanet: weight and attention network for video summarization
by: Arpan Basu, et al.
Published: (2024-01-01) -
SUM-GAN-GEA: Video Summarization Using GAN with Gaussian Distribution and External Attention
by: Qinghao Yu, et al.
Published: (2022-10-01)