A Hybrid Transformer-LSTM Model With 3D Separable Convolution for Video Prediction

Video prediction is an essential vision task due to its wide applications in real-world scenarios. However, it is indeed challenging due to the inherent uncertainty and complex spatiotemporal dynamics of video content. Several state-of-the-art deep learning methods have achieved superior video predi...

Full description

Bibliographic Details
Main Authors: Mareeta Mathai, Ying Liu, Nam Ling
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10464302/