End-to-end learning, and audio-visual human-centric video understanding

End-to-end learning, and audio-visual human-centric video understanding

<p>The field of machine learning has seen tremendous progress in the last decade, largely due to the advent of deep neural networks. When trained on large-scale labelled datasets, these machine learning algorithms can learn powerful semantic representations directly from the input data, end-to...

Descripció completa

Dades bibliogràfiques
Autor principal:	Brown, A
Altres autors:	Zisserman, A
Format:	Thesis
Idioma:	English
Publicat:	2022
Matèries:	Machine learning Deep learning (Machine learning) Computer vision

Ítems similars

Sign language understanding using multimodal learning
per: Momeni, L
Publicat: (2024)

Video understanding using multimodal deep learning
per: Nagrani, A
Publicat: (2020)

Deep vision for indoor understanding and localisation
per: Howard-Jenkins, H
Publicat: (2022)

END TO END LEARNING FOR A DRIVING SIMULATOR
per: V. F. Alexeev, et al.
Publicat: (2019-06-01)

Self-Supervised Learning for Audio-Visual Relationships of Videos With Stereo Sounds
per: Tomoya Sato, et al.
Publicat: (2022-01-01)

Towards unified visual perception
per: Sun, S
Publicat: (2024)

Learning to understand large-scale 3D point clouds
per: Qingyong, H
Publicat: (2022)

Understanding video through the lens of language
per: Bain, M
Publicat: (2023)

Learning object-centric representations
per: Kosiorek, AR
Publicat: (2019)

Self-supervised video representation learning
per: Han, T
Publicat: (2022)

Self-supervised and cross-modal learning from videos
per: Koepke, AS
Publicat: (2019)

Learning with multimodal self-supervision
per: Chen, H
Publicat: (2021)

Learning dense prediction: from correspondence to segmentation
per: Zhang, F
Publicat: (2022)

Audio-visual deep learning
per: Afouras, T, et al.
Publicat: (2021)

Visual recognition in art using machine learning
per: Crowley, E
Publicat: (2017)

Machine Learning and End-to-End Deep Learning for Monitoring Driver Distractions From Physiological and Visual Signals
per: Martin Gjoreski, et al.
Publicat: (2020-01-01)

Self-supervised learning using motion and visualizing convolutional neural networks
per: Mahendran, A
Publicat: (2018)

Learning Non-Parametric Surrogate Losses With Correlated Gradients
per: Seungdong Yoa, et al.
Publicat: (2021-01-01)

Towards diverse generation and reliable classification using neural networks
per: Kulharia, V
Publicat: (2022)

Developing object perception in the low data regime
per: Kaul, P
Publicat: (2024)

Measuring the security of computer vision systems to adversarial attacks
per: Lovisotto, G
Publicat: (2022)

An Interpretable Deep Learning-Based Feature Reduction in Video-Based Human Activity Recognition
per: Micheal Dutt, et al.
Publicat: (2024-01-01)

Learning visual concepts with fewer human annotations
per: Ehrhardt, S
Publicat: (2020)

Machine learning for audio, image and video analysis : theory and applications /
per: Camastra, Francesco, 1960-, et al.
Publicat: (2008)

End-to-End, Pixel-Wise Vessel-Specific Coronary and Aortic Calcium Detection and Scoring Using Deep Learning
per: Gurpreet Singh, et al.
Publicat: (2021-02-01)

Fusion of Visual and Audio Signals for Wildlife Surveillance
per: Cheng Hao Ng, et al.
Publicat: (2022-11-01)

Unsupervised learning of clutter-resistant visual representations from natural videos
per: Liao, Qianli, et al.
Publicat: (2015)

Advancing machine learning in astrophysics
per: Walmsley, M
Publicat: (2021)

Robust 2D and 3D registration with deep neural networks
per: Wang, Z
Publicat: (2024)

Holistic image understanding with deep learning and dense random fields
per: Zheng, S
Publicat: (2016)

End-to-end DVB-S2X system design with deep learning-based channel estimation over satellite fading channels
per: Mfarej, Sumaya Dhari Awad
Publicat: (2021)

A Survey on Audio-Video Based Defect Detection Through Deep Learning in Railway Maintenance
per: Lorenzo De Donato, et al.
Publicat: (2022-01-01)

Deep Learning Architecture Reduction for fMRI Data
per: Ruben Alvarez-Gonzalez, et al.
Publicat: (2022-02-01)

Using Deep Neural Networks for Human Fall Detection Based on Pose Estimation
per: Mohammadamin Salimi, et al.
Publicat: (2022-06-01)

Reinforcement learning for rule selection in end to end differentiable proving
per: Morris, M
Publicat: (2021)

On the Use of Deep Reinforcement Learning for Visual Tracking: A Survey
per: Giorgio Cruciata, et al.
Publicat: (2021-01-01)

Deep learning in computer vision: A critical review of emerging techniques and application scenarios
per: Junyi Chai, et al.
Publicat: (2021-12-01)

Corrigendum: Deep Plant Phenomics: A Deep Learning Platform for Complex Plant Phenotyping Tasks
per: Jordan R. Ubbens, et al.
Publicat: (2018-01-01)

Automatic plant phenotyping analysis of Melon (Cucumis melo L.) germplasm resources using deep learning methods and computer vision
per: Shan Xu, et al.
Publicat: (2024-10-01)

Looking deep at people: towards understanding and generating humans in images with deep learning
per: de Bem, RA
Publicat: (2018)