Video understanding using multimodal deep learning

Video understanding using multimodal deep learning

<p>Our experience of the world is multimodal, however deep learning networks have been traditionally designed for and trained on unimodal inputs such as images, audio segments or text. In this thesis we develop strategies to exploit multimodal information (in the form of vision, text, speech a...

תיאור מלא

מידע ביבליוגרפי
מחבר ראשי:	Nagrani, A
מחברים אחרים:	Zisserman, A
פורמט:	Thesis
שפה:	English
יצא לאור:	2020
נושאים:	Computer Vision Machine Learning

פריטים דומים

Sign language understanding using multimodal learning
מאת: Momeni, L
יצא לאור: (2024)

Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention
מאת: Adam Bielski, et al.
יצא לאור: (2018-01-01)

End-to-end learning, and audio-visual human-centric video understanding
מאת: Brown, A
יצא לאור: (2022)

Holistic image understanding with deep learning and dense random fields
מאת: Zheng, S
יצא לאור: (2016)

Learning with multimodal self-supervision
מאת: Chen, H
יצא לאור: (2021)

Self-supervised video representation learning
מאת: Han, T
יצא לאור: (2022)

Self-supervised and cross-modal learning from videos
מאת: Koepke, AS
יצא לאור: (2019)

Deep vision for indoor understanding and localisation
מאת: Howard-Jenkins, H
יצא לאור: (2022)

Understanding video through the lens of language
מאת: Bain, M
יצא לאור: (2023)

Pixel-level scene understanding with deep structured models
מאת: Arnab, A
יצא לאור: (2019)

Deep Vision Multimodal Learning: Methodology, Benchmark, and Trend
מאת: Wenhao Chai, et al.
יצא לאור: (2022-06-01)

Looking deep at people: towards understanding and generating humans in images with deep learning
מאת: de Bem, RA
יצא לאור: (2018)

Learning to understand large-scale 3D point clouds
מאת: Qingyong, H
יצא לאור: (2022)

Self-supervised learning using motion and visualizing convolutional neural networks
מאת: Mahendran, A
יצא לאור: (2018)

Visual recognition in art using machine learning
מאת: Crowley, E
יצא לאור: (2017)

DEEP LEARNING-BASED MODEL FOR CLASSIFICATION OF BEAN NITROGEN STATUS USING DIGITAL CANOPY IMAGING
מאת: Murilo M. Baesso, et al.
יצא לאור: (2023-06-01)

Seeing What You’re Told: Sentence-Guided Activity Recognition In Video
מאת: Siddharth, Narayanaswamy, et al.
יצא לאור: (2015)

An Interpretable Deep Learning-Based Feature Reduction in Video-Based Human Activity Recognition
מאת: Micheal Dutt, et al.
יצא לאור: (2024-01-01)

A Survey on Audio-Video Based Defect Detection Through Deep Learning in Railway Maintenance
מאת: Lorenzo De Donato, et al.
יצא לאור: (2022-01-01)

On the Generalization of Deep Learning Models in Video Deepfake Detection
מאת: Davide Alessandro Coccomini, et al.
יצא לאור: (2023-04-01)

Automatic Detection for Acromegaly Using Hand Photographs: A Deep-Learning Approach
מאת: Chengbin Duan, et al.
יצא לאור: (2021-01-01)

HyMNet: A Multimodal Deep Learning System for Hypertension Prediction Using Fundus Images and Cardiometabolic Risk Factors
מאת: Mohammed Baharoon, et al.
יצא לאור: (2024-10-01)

Corrigendum: Deep Plant Phenomics: A Deep Learning Platform for Complex Plant Phenotyping Tasks
מאת: Jordan R. Ubbens, et al.
יצא לאור: (2018-01-01)

Scalable learning for expanding robot vision
מאת: Porav, H
יצא לאור: (2020)

Robust 2D and 3D registration with deep neural networks
מאת: Wang, Z
יצא לאור: (2024)

Learning shape from images
מאת: Wiles, O
יצא לאור: (2020)

Unsupervised learning of clutter-resistant visual representations from natural videos
מאת: Liao, Qianli, et al.
יצא לאור: (2015)

Challenges and Applications for Implementing Machine Learning in Computer Vision /
מאת: Kashyap, Ramgopal, 1984- editor., et al.
יצא לאור: ([202)

Understanding Mixup Training Methods
מאת: Daojun Liang, et al.
יצא לאור: (2018-01-01)

Multimodal Image-Based Indoor Localization with Machine Learning—A Systematic Review
מאת: Szymon Łukasik, et al.
יצא לאור: (2024-09-01)

Structured learning and prediction in computer vision /
מאת: 525432 Nowozin, Sebastian, et al.
יצא לאור: (2011)

A Dataset of apical periodontitis lesions in panoramic radiographs for deep-learning-based classification and detection
מאת: Hoang Viet Do, et al.
יצא לאור: (2024-06-01)

Use and examination of convolutional neural networks for scene understanding
מאת: Jetley, S
יצא לאור: (2018)

Deep Learning Architecture Reduction for fMRI Data
מאת: Ruben Alvarez-Gonzalez, et al.
יצא לאור: (2022-02-01)

Unsupervised learning of 3d objects in the wild
מאת: Wu, S
יצא לאור: (2022)

Deep learning based computer vision approaches for smart agricultural applications
מאת: V.G. Dhanya, et al.
יצא לאור: (2022-01-01)

Classification of protected grassland habitats using deep learning architectures on Sentinel-2 satellite imagery data
מאת: Gabriel Díaz-Ireland, et al.
יצא לאור: (2024-11-01)

Computer vision and machine learning with RGB-D sensors /
מאת: Shao, Ling
יצא לאור: (c201)

Multimodal Deep Learning Integration of Image, Weather, and Phenotypic Data Under Temporal Effects for Early Prediction of Maize Yield
מאת: Danial Shamsuddin, et al.
יצא לאור: (2024-10-01)

Weakly-supervised learning for video understanding
מאת: Deng, Dingfan
יצא לאור: (2023)