End-to-end learning, and audio-visual human-centric video understanding

End-to-end learning, and audio-visual human-centric video understanding

<p>The field of machine learning has seen tremendous progress in the last decade, largely due to the advent of deep neural networks. When trained on large-scale labelled datasets, these machine learning algorithms can learn powerful semantic representations directly from the input data, end-to...

Descrición completa

Detalles Bibliográficos
Autor Principal:	Brown, A
Outros autores:	Zisserman, A
Formato:	Thesis
Idioma:	English
Publicado:	2022
Subjects:	Machine learning Deep learning (Machine learning) Computer vision

Títulos similares

Sign language understanding using multimodal learning
por: Momeni, L
Publicado: (2024)

Video understanding using multimodal deep learning
por: Nagrani, A
Publicado: (2020)

Deep vision for indoor understanding and localisation
por: Howard-Jenkins, H
Publicado: (2022)

END TO END LEARNING FOR A DRIVING SIMULATOR
por: V. F. Alexeev, et al.
Publicado: (2019-06-01)

Self-Supervised Learning for Audio-Visual Relationships of Videos With Stereo Sounds
por: Tomoya Sato, et al.
Publicado: (2022-01-01)

Towards unified visual perception
por: Sun, S
Publicado: (2024)

Learning to understand large-scale 3D point clouds
por: Qingyong, H
Publicado: (2022)

Understanding video through the lens of language
por: Bain, M
Publicado: (2023)

Learning object-centric representations
por: Kosiorek, AR
Publicado: (2019)

Self-supervised video representation learning
por: Han, T
Publicado: (2022)

Self-supervised and cross-modal learning from videos
por: Koepke, AS
Publicado: (2019)

Learning with multimodal self-supervision
por: Chen, H
Publicado: (2021)

Learning dense prediction: from correspondence to segmentation
por: Zhang, F
Publicado: (2022)

Audio-visual deep learning
por: Afouras, T, et al.
Publicado: (2021)

Visual recognition in art using machine learning
por: Crowley, E
Publicado: (2017)

Machine Learning and End-to-End Deep Learning for Monitoring Driver Distractions From Physiological and Visual Signals
por: Martin Gjoreski, et al.
Publicado: (2020-01-01)

Self-supervised learning using motion and visualizing convolutional neural networks
por: Mahendran, A
Publicado: (2018)

Learning Non-Parametric Surrogate Losses With Correlated Gradients
por: Seungdong Yoa, et al.
Publicado: (2021-01-01)

Towards diverse generation and reliable classification using neural networks
por: Kulharia, V
Publicado: (2022)

Developing object perception in the low data regime
por: Kaul, P
Publicado: (2024)

Measuring the security of computer vision systems to adversarial attacks
por: Lovisotto, G
Publicado: (2022)

An Interpretable Deep Learning-Based Feature Reduction in Video-Based Human Activity Recognition
por: Micheal Dutt, et al.
Publicado: (2024-01-01)

Learning visual concepts with fewer human annotations
por: Ehrhardt, S
Publicado: (2020)

Machine learning for audio, image and video analysis : theory and applications /
por: Camastra, Francesco, 1960-, et al.
Publicado: (2008)

End-to-End, Pixel-Wise Vessel-Specific Coronary and Aortic Calcium Detection and Scoring Using Deep Learning
por: Gurpreet Singh, et al.
Publicado: (2021-02-01)

Fusion of Visual and Audio Signals for Wildlife Surveillance
por: Cheng Hao Ng, et al.
Publicado: (2022-11-01)

Unsupervised learning of clutter-resistant visual representations from natural videos
por: Liao, Qianli, et al.
Publicado: (2015)

Advancing machine learning in astrophysics
por: Walmsley, M
Publicado: (2021)

Robust 2D and 3D registration with deep neural networks
por: Wang, Z
Publicado: (2024)

Holistic image understanding with deep learning and dense random fields
por: Zheng, S
Publicado: (2016)

End-to-end DVB-S2X system design with deep learning-based channel estimation over satellite fading channels
por: Mfarej, Sumaya Dhari Awad
Publicado: (2021)

A Survey on Audio-Video Based Defect Detection Through Deep Learning in Railway Maintenance
por: Lorenzo De Donato, et al.
Publicado: (2022-01-01)

Deep Learning Architecture Reduction for fMRI Data
por: Ruben Alvarez-Gonzalez, et al.
Publicado: (2022-02-01)

Using Deep Neural Networks for Human Fall Detection Based on Pose Estimation
por: Mohammadamin Salimi, et al.
Publicado: (2022-06-01)

Reinforcement learning for rule selection in end to end differentiable proving
por: Morris, M
Publicado: (2021)

On the Use of Deep Reinforcement Learning for Visual Tracking: A Survey
por: Giorgio Cruciata, et al.
Publicado: (2021-01-01)

Deep learning in computer vision: A critical review of emerging techniques and application scenarios
por: Junyi Chai, et al.
Publicado: (2021-12-01)

Corrigendum: Deep Plant Phenomics: A Deep Learning Platform for Complex Plant Phenotyping Tasks
por: Jordan R. Ubbens, et al.
Publicado: (2018-01-01)

Automatic plant phenotyping analysis of Melon (Cucumis melo L.) germplasm resources using deep learning methods and computer vision
por: Shan Xu, et al.
Publicado: (2024-10-01)

Looking deep at people: towards understanding and generating humans in images with deep learning
por: de Bem, RA
Publicado: (2018)