End-to-end learning, and audio-visual human-centric video understanding

End-to-end learning, and audio-visual human-centric video understanding

<p>The field of machine learning has seen tremendous progress in the last decade, largely due to the advent of deep neural networks. When trained on large-scale labelled datasets, these machine learning algorithms can learn powerful semantic representations directly from the input data, end-to...

Description complète

Détails bibliographiques
Auteur principal:	Brown, A
Autres auteurs:	Zisserman, A
Format:	Thèse
Langue:	English
Publié:	2022
Sujets:	Machine learning Deep learning (Machine learning) Computer vision

Documents similaires

Sign language understanding using multimodal learning
par: Momeni, L
Publié: (2024)

Video understanding using multimodal deep learning
par: Nagrani, A
Publié: (2020)

Deep vision for indoor understanding and localisation
par: Howard-Jenkins, H
Publié: (2022)

END TO END LEARNING FOR A DRIVING SIMULATOR
par: V. F. Alexeev, et autres
Publié: (2019-06-01)

Self-Supervised Learning for Audio-Visual Relationships of Videos With Stereo Sounds
par: Tomoya Sato, et autres
Publié: (2022-01-01)

Towards unified visual perception
par: Sun, S
Publié: (2024)

Learning to understand large-scale 3D point clouds
par: Qingyong, H
Publié: (2022)

Understanding video through the lens of language
par: Bain, M
Publié: (2023)

Learning object-centric representations
par: Kosiorek, AR
Publié: (2019)

Self-supervised video representation learning
par: Han, T
Publié: (2022)

Self-supervised and cross-modal learning from videos
par: Koepke, AS
Publié: (2019)

Learning with multimodal self-supervision
par: Chen, H
Publié: (2021)

Learning dense prediction: from correspondence to segmentation
par: Zhang, F
Publié: (2022)

Audio-visual deep learning
par: Afouras, T, et autres
Publié: (2021)

Visual recognition in art using machine learning
par: Crowley, E
Publié: (2017)

Machine Learning and End-to-End Deep Learning for Monitoring Driver Distractions From Physiological and Visual Signals
par: Martin Gjoreski, et autres
Publié: (2020-01-01)

Self-supervised learning using motion and visualizing convolutional neural networks
par: Mahendran, A
Publié: (2018)

Learning Non-Parametric Surrogate Losses With Correlated Gradients
par: Seungdong Yoa, et autres
Publié: (2021-01-01)

Towards diverse generation and reliable classification using neural networks
par: Kulharia, V
Publié: (2022)

Developing object perception in the low data regime
par: Kaul, P
Publié: (2024)

Measuring the security of computer vision systems to adversarial attacks
par: Lovisotto, G
Publié: (2022)

An Interpretable Deep Learning-Based Feature Reduction in Video-Based Human Activity Recognition
par: Micheal Dutt, et autres
Publié: (2024-01-01)

Learning visual concepts with fewer human annotations
par: Ehrhardt, S
Publié: (2020)

Machine learning for audio, image and video analysis : theory and applications /
par: Camastra, Francesco, 1960-, et autres
Publié: (2008)

End-to-End, Pixel-Wise Vessel-Specific Coronary and Aortic Calcium Detection and Scoring Using Deep Learning
par: Gurpreet Singh, et autres
Publié: (2021-02-01)

Fusion of Visual and Audio Signals for Wildlife Surveillance
par: Cheng Hao Ng, et autres
Publié: (2022-11-01)

Unsupervised learning of clutter-resistant visual representations from natural videos
par: Liao, Qianli, et autres
Publié: (2015)

Advancing machine learning in astrophysics
par: Walmsley, M
Publié: (2021)

Robust 2D and 3D registration with deep neural networks
par: Wang, Z
Publié: (2024)

Holistic image understanding with deep learning and dense random fields
par: Zheng, S
Publié: (2016)

End-to-end DVB-S2X system design with deep learning-based channel estimation over satellite fading channels
par: Mfarej, Sumaya Dhari Awad
Publié: (2021)

A Survey on Audio-Video Based Defect Detection Through Deep Learning in Railway Maintenance
par: Lorenzo De Donato, et autres
Publié: (2022-01-01)

Deep Learning Architecture Reduction for fMRI Data
par: Ruben Alvarez-Gonzalez, et autres
Publié: (2022-02-01)

Using Deep Neural Networks for Human Fall Detection Based on Pose Estimation
par: Mohammadamin Salimi, et autres
Publié: (2022-06-01)

Reinforcement learning for rule selection in end to end differentiable proving
par: Morris, M
Publié: (2021)

On the Use of Deep Reinforcement Learning for Visual Tracking: A Survey
par: Giorgio Cruciata, et autres
Publié: (2021-01-01)

Deep learning in computer vision: A critical review of emerging techniques and application scenarios
par: Junyi Chai, et autres
Publié: (2021-12-01)

Corrigendum: Deep Plant Phenomics: A Deep Learning Platform for Complex Plant Phenotyping Tasks
par: Jordan R. Ubbens, et autres
Publié: (2018-01-01)

Automatic plant phenotyping analysis of Melon (Cucumis melo L.) germplasm resources using deep learning methods and computer vision
par: Shan Xu, et autres
Publié: (2024-10-01)

Looking deep at people: towards understanding and generating humans in images with deep learning
par: de Bem, RA
Publié: (2018)