Video Scene Detection Using Transformer Encoding Linker Network (TELNet)

This paper introduces a transformer encoding linker network (TELNet) for automatically identifying scene boundaries in videos without prior knowledge of their structure. Videos consist of sequences of semantically related shots or chapters, and recognizing scene boundaries is crucial for various vid...

Full description

Bibliographic Details
Main Authors: Shu-Ming Tseng, Zhi-Ting Yeh, Chia-Yang Wu, Jia-Bin Chang, Mehdi Norouzi
Format: Article
Language:English
Published: MDPI AG 2023-08-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/23/16/7050