Spectral Representation Learning and Fusion for Autonomous Vehicles Trip Description Exploiting Recurrent Transformer

A thorough analysis and comprehension of the entire cue set in visual data are indispensable for an ideal video description model. As outlined in recent algorithm proposals, video descriptions have primarily been generated by learning RGB and optical flow representations rather than exploring and in...

Full description

Bibliographic Details
Main Authors: Ghazala Rafiq, Muhammad Rafiq, Gyu Sang Choi
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10155442/