LASFormer: Light Transformer for Action Segmentation with Receptive Field-Guided Distillation and Action Relation Encoding

Transformer-based models for action segmentation have achieved high frame-wise accuracy against challenging benchmarks. However, they rely on multiple decoders and self-attention blocks for informative representations, whose huge computing and memory costs remain an obstacle to handling long video s...

Full description

Bibliographic Details
Main Authors: Zhichao Ma, Kan Li
Format: Article
Language:English
Published: MDPI AG 2023-12-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/12/1/57