Bilingual video captioning model for enhanced video retrieval

Bilingual video captioning model for enhanced video retrieval

Abstract Many video platforms rely on the descriptions that uploaders provide for video retrieval. However, this reliance may cause inaccuracies. Although deep learning-based video captioning can resolve this problem, it has some limitations: (1) traditional keyframe extraction techniques do not con...

Full description

Bibliographic Details
Main Authors:	Norah Alrebdi, Amal A. Al-Shargabi
Format:	Article
Language:	English
Published:	SpringerOpen 2024-01-01
Series:	Journal of Big Data
Subjects:	Artificial intelligence Computer vision Natural language processing Video retrieval English video captioning Arabic video captioning
Online Access:	https://doi.org/10.1186/s40537-024-00878-w

Similar Items

Deep learning and knowledge graph for image/video captioning: A review of datasets, evaluation metrics, and methods
by: Mohammad Saif Wajid, et al.
Published: (2024-01-01)

Step by Step: A Gradual Approach for Dense Video Captioning
by: Wangyu Choi, et al.
Published: (2023-01-01)

Real-time Arabic Video Captioning Using CNN and Transformer Networks Based on Parallel Implementation
by: Adel Jalal Yousif, et al.
Published: (2024-03-01)

Parallel Pathway Dense Video Captioning With Deformable Transformer
by: Wangyu Choi, et al.
Published: (2022-01-01)

PWS-DVC: Enhancing Weakly Supervised Dense Video Captioning With Pretraining Approach
by: Wangyu Choi, et al.
Published: (2023-01-01)

Cross-modal graph with meta concepts for video captioning
by: Wang, Hao, et al.
Published: (2022)

Fusion of Multi-Modal Features to Enhance Dense Video Caption
by: Xuefei Huang, et al.
Published: (2023-06-01)

Parallel Dense Video Caption Generation with Multi-Modal Features
by: Xuefei Huang, et al.
Published: (2023-08-01)

Exploring deep learning approaches for video captioning: A comprehensive review
by: Adel Jalal Yousif, et al.
Published: (2023-12-01)

Video Question-Answering Techniques, Benchmark Datasets and Evaluation Metrics Leveraging Video Captioning: A Comprehensive Survey
by: Khushboo Khurana, et al.
Published: (2021-01-01)

Adaptive Curriculum Learning for Video Captioning
by: Shanhao Li, et al.
Published: (2022-01-01)

CapERA: Captioning Events in Aerial Videos
by: Laila Bashmal, et al.
Published: (2023-04-01)

UAT: Universal Attention Transformer for Video Captioning
by: Heeju Im, et al.
Published: (2022-06-01)

Comparing the effectiveness of explicit EAL feedback through slideshow (text+audio) and captioned video
by: Jonathan Harrison
Published: (2022-04-01)

The why-what-when-who-how of using captioned videos as an instructional aid in EAL classrooms: Theoretical perspectives and classroom implications
by: Trinh Thai Van Phuc
Published: (2022-06-01)

Teaching Medical English through Professional Captioning Videos
by: Džuganová Božena
Published: (2019-09-01)

Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates
by: Nicholas Moratelli, et al.
Published: (2023-01-01)

Multi-Task Video Captioning with a Stepwise Multimodal Encoder
by: Zihao Liu, et al.
Published: (2022-08-01)

Video Caption Based Searching Using End-to-End Dense Captioning and Sentence Embeddings
by: Akshay Aggarwal, et al.
Published: (2020-06-01)

Action knowledge for video captioning with graph neural networks
by: Willy Fitra Hendria, et al.
Published: (2023-04-01)

Quality Enhancement Based Video Captioning in Video Communication Systems
by: The Van Le, et al.
Published: (2024-01-01)

Video Captions for Online Courses: Do YouTube’s Auto-generated Captions Meet Deaf Students’ Needs?
by: Becky Sue Parton
Published: (2016-08-01)

Video Captions for Online Courses: Do YouTube’s Auto-generated Captions Meet Deaf Students’ Needs?
by: Becky Sue Parton
Published: (2016-08-01)

MFVC: Urban Traffic Scene Video Caption Based on Multimodal Fusion
by: Mingxing Li, et al.
Published: (2022-09-01)

Empirical autopsy of deep video captioning encoder-decoder architecture
by: Nayyer Aafaq, et al.
Published: (2021-03-01)

Video Description: Datasets & Evaluation Metrics
by: Muhammad Rafiq, et al.
Published: (2021-01-01)

Evaluation metrics for video captioning: A survey
by: Andrei de Souza Inácio, et al.
Published: (2023-09-01)

Video captioning based on vision transformer and reinforcement learning
by: Hong Zhao, et al.
Published: (2022-03-01)

A Fine-Grained Spatial-Temporal Attention Model for Video Captioning
by: An-An Liu, et al.
Published: (2018-01-01)

Automatic Image and Video Caption Generation With Deep Learning: A Concise Review and Algorithmic Overlap
by: Soheyla Amirian, et al.
Published: (2020-01-01)

Video captioning with stacked attention and semantic hard pull
by: Md. Mushfiqur Rahman, et al.
Published: (2021-08-01)

Video Captioning Based on Channel Soft Attention and Semantic Reconstructor
by: Zhou Lei, et al.
Published: (2021-02-01)

Vision-Text Cross-Modal Fusion for Accurate Video Captioning
by: Kaouther Ouenniche, et al.
Published: (2023-01-01)

Semantic-filtered Soft-Split-Aware video captioning with audio-augmented feature
by: Xu, Yuecong, et al.
Published: (2021)

Examining the Educational Benefits of and Attitudes Toward Closed-Captioning Among Undergraduate Students
by: Bryan Dallas, et al.
Published: (2016-04-01)

A Multimodal Framework for Video Caption Generation
by: Reshmi S. Bhooshan, et al.
Published: (2022-01-01)

Image Caption Generation via Unified Retrieval and Generation-Based Method
by: Shanshan Zhao, et al.
Published: (2020-09-01)

DeepRide: Dashcam Video Description Dataset for Autonomous Vehicle Location-Aware Trip Description
by: Ghazala Rafiq, et al.
Published: (2022-01-01)

A Semantics-Assisted Video Captioning Model Trained With Scheduled Sampling
by: Haoran Chen, et al.
Published: (2020-09-01)

Video Captioning With Adaptive Attention and Mixed Loss Optimization
by: Huanhou Xiao, et al.
Published: (2019-01-01)