Tencent AVS: A Holistic Ads Video Dataset for Multi-Modal Scene Segmentation
Temporal video segmentation and classification have been advanced greatly by public benchmarks in recent years. However, such research still mainly focuses on human actions, failing to describe videos in a holistic view. In addition, previous research tends to pay much attention to visual informatio...
Main Authors: | Jie Jiang, Zhimin Li, Jiangfeng Xiong, Rongwei Quan, Qinglin Lu, Wei Liu |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2022-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9973306/ |
Similar Items
-
Accelerating Deep Neural Networks for Efficient Scene Understanding in Multi-Modal Automotive Applications
by: Stavros Nousias, et al.
Published: (2023-01-01) -
Short Video Behavior Recognition Combining Scene and Behavior Features
by: DONG Xu, TAN Li, ZHOU Lina, SONG Yanyan
Published: (2020-10-01) -
Mutual Complementarity: Multi-Modal Enhancement Semantic Learning for Micro-Video Scene Recognition
by: Jie Guo, et al.
Published: (2020-01-01) -
VidOnt: a core reference ontology for reasoning over video scenes
by: Leslie F. Sikos
Published: (2018-04-01) -
Contrasting Multi-Modal Similarity Framework for Video Scene Segmentation
by: Jinwoo Park, et al.
Published: (2024-01-01)