Video understanding using multimodal deep learning

Video understanding using multimodal deep learning

<p>Our experience of the world is multimodal, however deep learning networks have been traditionally designed for and trained on unimodal inputs such as images, audio segments or text. In this thesis we develop strategies to exploit multimodal information (in the form of vision, text, speech a...

Podrobná bibliografie
Hlavní autor:	Nagrani, A
Další autoři:	Zisserman, A
Médium:	Diplomová práce
Jazyk:	English
Vydáno:	2020
Témata:	Computer Vision Machine Learning

Podobné jednotky

Sign language understanding using multimodal learning
Autor: Momeni, L
Vydáno: (2024)

Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention
Autor: Adam Bielski, a další
Vydáno: (2018-01-01)

End-to-end learning, and audio-visual human-centric video understanding
Autor: Brown, A
Vydáno: (2022)

Holistic image understanding with deep learning and dense random fields
Autor: Zheng, S
Vydáno: (2016)

Learning with multimodal self-supervision
Autor: Chen, H
Vydáno: (2021)

Self-supervised video representation learning
Autor: Han, T
Vydáno: (2022)

Self-supervised and cross-modal learning from videos
Autor: Koepke, AS
Vydáno: (2019)

Deep vision for indoor understanding and localisation
Autor: Howard-Jenkins, H
Vydáno: (2022)

Understanding video through the lens of language
Autor: Bain, M
Vydáno: (2023)

Pixel-level scene understanding with deep structured models
Autor: Arnab, A
Vydáno: (2019)

Deep Vision Multimodal Learning: Methodology, Benchmark, and Trend
Autor: Wenhao Chai, a další
Vydáno: (2022-06-01)

Looking deep at people: towards understanding and generating humans in images with deep learning
Autor: de Bem, RA
Vydáno: (2018)

Learning to understand large-scale 3D point clouds
Autor: Qingyong, H
Vydáno: (2022)

Self-supervised learning using motion and visualizing convolutional neural networks
Autor: Mahendran, A
Vydáno: (2018)

Visual recognition in art using machine learning
Autor: Crowley, E
Vydáno: (2017)

DEEP LEARNING-BASED MODEL FOR CLASSIFICATION OF BEAN NITROGEN STATUS USING DIGITAL CANOPY IMAGING
Autor: Murilo M. Baesso, a další
Vydáno: (2023-06-01)

Seeing What You’re Told: Sentence-Guided Activity Recognition In Video
Autor: Siddharth, Narayanaswamy, a další
Vydáno: (2015)

An Interpretable Deep Learning-Based Feature Reduction in Video-Based Human Activity Recognition
Autor: Micheal Dutt, a další
Vydáno: (2024-01-01)

A Survey on Audio-Video Based Defect Detection Through Deep Learning in Railway Maintenance
Autor: Lorenzo De Donato, a další
Vydáno: (2022-01-01)

On the Generalization of Deep Learning Models in Video Deepfake Detection
Autor: Davide Alessandro Coccomini, a další
Vydáno: (2023-04-01)

Automatic Detection for Acromegaly Using Hand Photographs: A Deep-Learning Approach
Autor: Chengbin Duan, a další
Vydáno: (2021-01-01)

HyMNet: A Multimodal Deep Learning System for Hypertension Prediction Using Fundus Images and Cardiometabolic Risk Factors
Autor: Mohammed Baharoon, a další
Vydáno: (2024-10-01)

Corrigendum: Deep Plant Phenomics: A Deep Learning Platform for Complex Plant Phenotyping Tasks
Autor: Jordan R. Ubbens, a další
Vydáno: (2018-01-01)

Scalable learning for expanding robot vision
Autor: Porav, H
Vydáno: (2020)

Robust 2D and 3D registration with deep neural networks
Autor: Wang, Z
Vydáno: (2024)

Learning shape from images
Autor: Wiles, O
Vydáno: (2020)

Unsupervised learning of clutter-resistant visual representations from natural videos
Autor: Liao, Qianli, a další
Vydáno: (2015)

Challenges and Applications for Implementing Machine Learning in Computer Vision /
Autor: Kashyap, Ramgopal, 1984- editor., a další
Vydáno: ([202)

Understanding Mixup Training Methods
Autor: Daojun Liang, a další
Vydáno: (2018-01-01)

Multimodal Image-Based Indoor Localization with Machine Learning—A Systematic Review
Autor: Szymon Łukasik, a další
Vydáno: (2024-09-01)

Structured learning and prediction in computer vision /
Autor: 525432 Nowozin, Sebastian, a další
Vydáno: (2011)

A Dataset of apical periodontitis lesions in panoramic radiographs for deep-learning-based classification and detection
Autor: Hoang Viet Do, a další
Vydáno: (2024-06-01)

Use and examination of convolutional neural networks for scene understanding
Autor: Jetley, S
Vydáno: (2018)

Deep Learning Architecture Reduction for fMRI Data
Autor: Ruben Alvarez-Gonzalez, a další
Vydáno: (2022-02-01)

Unsupervised learning of 3d objects in the wild
Autor: Wu, S
Vydáno: (2022)

Deep learning based computer vision approaches for smart agricultural applications
Autor: V.G. Dhanya, a další
Vydáno: (2022-01-01)

Classification of protected grassland habitats using deep learning architectures on Sentinel-2 satellite imagery data
Autor: Gabriel Díaz-Ireland, a další
Vydáno: (2024-11-01)

Computer vision and machine learning with RGB-D sensors /
Autor: Shao, Ling
Vydáno: (c201)

Multimodal Deep Learning Integration of Image, Weather, and Phenotypic Data Under Temporal Effects for Early Prediction of Maize Yield
Autor: Danial Shamsuddin, a další
Vydáno: (2024-10-01)

Weakly-supervised learning for video understanding
Autor: Deng, Dingfan
Vydáno: (2023)