A Hybrid Transformer-LSTM Model With 3D Separable Convolution for Video Prediction

Video prediction is an essential vision task due to its wide applications in real-world scenarios. However, it is indeed challenging due to the inherent uncertainty and complex spatiotemporal dynamics of video content. Several state-of-the-art deep learning methods have achieved superior video predi...

Full description

Bibliographic Details
Main Authors:	Mareeta Mathai, Ying Liu, Nam Ling
Format:	Article
Language:	English
Published:	IEEE 2024-01-01
Series:	IEEE Access
Subjects:	3D separable convolution deep learning depthwise convolution LSTM pointwise convolution self-attention
Online Access:	https://ieeexplore.ieee.org/document/10464302/

Internet

https://ieeexplore.ieee.org/document/10464302/

A Hybrid Transformer-LSTM Model With 3D Separable Convolution for Video Prediction

Internet

Similar Items