Video understanding using multimodal deep learning

Video understanding using multimodal deep learning

<p>Our experience of the world is multimodal, however deep learning networks have been traditionally designed for and trained on unimodal inputs such as images, audio segments or text. In this thesis we develop strategies to exploit multimodal information (in the form of vision, text, speech a...

Повний опис

Бібліографічні деталі
Автор:	Nagrani, A
Інші автори:	Zisserman, A
Формат:	Дисертація
Мова:	English
Опубліковано:	2020
Предмети:	Computer Vision Machine Learning

Схожі ресурси

Sign language understanding using multimodal learning
за авторством: Momeni, L
Опубліковано: (2024)

Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention
за авторством: Adam Bielski, та інші
Опубліковано: (2018-01-01)

End-to-end learning, and audio-visual human-centric video understanding
за авторством: Brown, A
Опубліковано: (2022)

Holistic image understanding with deep learning and dense random fields
за авторством: Zheng, S
Опубліковано: (2016)

Learning with multimodal self-supervision
за авторством: Chen, H
Опубліковано: (2021)

Self-supervised video representation learning
за авторством: Han, T
Опубліковано: (2022)

Self-supervised and cross-modal learning from videos
за авторством: Koepke, AS
Опубліковано: (2019)

Deep vision for indoor understanding and localisation
за авторством: Howard-Jenkins, H
Опубліковано: (2022)

Understanding video through the lens of language
за авторством: Bain, M
Опубліковано: (2023)

Pixel-level scene understanding with deep structured models
за авторством: Arnab, A
Опубліковано: (2019)

Deep Vision Multimodal Learning: Methodology, Benchmark, and Trend
за авторством: Wenhao Chai, та інші
Опубліковано: (2022-06-01)

Looking deep at people: towards understanding and generating humans in images with deep learning
за авторством: de Bem, RA
Опубліковано: (2018)

Learning to understand large-scale 3D point clouds
за авторством: Qingyong, H
Опубліковано: (2022)

Self-supervised learning using motion and visualizing convolutional neural networks
за авторством: Mahendran, A
Опубліковано: (2018)

Visual recognition in art using machine learning
за авторством: Crowley, E
Опубліковано: (2017)

DEEP LEARNING-BASED MODEL FOR CLASSIFICATION OF BEAN NITROGEN STATUS USING DIGITAL CANOPY IMAGING
за авторством: Murilo M. Baesso, та інші
Опубліковано: (2023-06-01)

Seeing What You’re Told: Sentence-Guided Activity Recognition In Video
за авторством: Siddharth, Narayanaswamy, та інші
Опубліковано: (2015)

An Interpretable Deep Learning-Based Feature Reduction in Video-Based Human Activity Recognition
за авторством: Micheal Dutt, та інші
Опубліковано: (2024-01-01)

A Survey on Audio-Video Based Defect Detection Through Deep Learning in Railway Maintenance
за авторством: Lorenzo De Donato, та інші
Опубліковано: (2022-01-01)

On the Generalization of Deep Learning Models in Video Deepfake Detection
за авторством: Davide Alessandro Coccomini, та інші
Опубліковано: (2023-04-01)

Automatic Detection for Acromegaly Using Hand Photographs: A Deep-Learning Approach
за авторством: Chengbin Duan, та інші
Опубліковано: (2021-01-01)

HyMNet: A Multimodal Deep Learning System for Hypertension Prediction Using Fundus Images and Cardiometabolic Risk Factors
за авторством: Mohammed Baharoon, та інші
Опубліковано: (2024-10-01)

Corrigendum: Deep Plant Phenomics: A Deep Learning Platform for Complex Plant Phenotyping Tasks
за авторством: Jordan R. Ubbens, та інші
Опубліковано: (2018-01-01)

Scalable learning for expanding robot vision
за авторством: Porav, H
Опубліковано: (2020)

Robust 2D and 3D registration with deep neural networks
за авторством: Wang, Z
Опубліковано: (2024)

Learning shape from images
за авторством: Wiles, O
Опубліковано: (2020)

Unsupervised learning of clutter-resistant visual representations from natural videos
за авторством: Liao, Qianli, та інші
Опубліковано: (2015)

Challenges and Applications for Implementing Machine Learning in Computer Vision /
за авторством: Kashyap, Ramgopal, 1984- editor., та інші
Опубліковано: ([202)

Understanding Mixup Training Methods
за авторством: Daojun Liang, та інші
Опубліковано: (2018-01-01)

Multimodal Image-Based Indoor Localization with Machine Learning—A Systematic Review
за авторством: Szymon Łukasik, та інші
Опубліковано: (2024-09-01)

Structured learning and prediction in computer vision /
за авторством: 525432 Nowozin, Sebastian, та інші
Опубліковано: (2011)

A Dataset of apical periodontitis lesions in panoramic radiographs for deep-learning-based classification and detection
за авторством: Hoang Viet Do, та інші
Опубліковано: (2024-06-01)

Use and examination of convolutional neural networks for scene understanding
за авторством: Jetley, S
Опубліковано: (2018)

Deep Learning Architecture Reduction for fMRI Data
за авторством: Ruben Alvarez-Gonzalez, та інші
Опубліковано: (2022-02-01)

Unsupervised learning of 3d objects in the wild
за авторством: Wu, S
Опубліковано: (2022)

Deep learning based computer vision approaches for smart agricultural applications
за авторством: V.G. Dhanya, та інші
Опубліковано: (2022-01-01)

Classification of protected grassland habitats using deep learning architectures on Sentinel-2 satellite imagery data
за авторством: Gabriel Díaz-Ireland, та інші
Опубліковано: (2024-11-01)

Computer vision and machine learning with RGB-D sensors /
за авторством: Shao, Ling
Опубліковано: (c201)

Multimodal Deep Learning Integration of Image, Weather, and Phenotypic Data Under Temporal Effects for Early Prediction of Maize Yield
за авторством: Danial Shamsuddin, та інші
Опубліковано: (2024-10-01)

Weakly-supervised learning for video understanding
за авторством: Deng, Dingfan
Опубліковано: (2023)