End-to-end learning, and audio-visual human-centric video understanding

End-to-end learning, and audio-visual human-centric video understanding

<p>The field of machine learning has seen tremendous progress in the last decade, largely due to the advent of deep neural networks. When trained on large-scale labelled datasets, these machine learning algorithms can learn powerful semantic representations directly from the input data, end-to...

وصف كامل

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Brown, A
مؤلفون آخرون:	Zisserman, A
التنسيق:	أطروحة
اللغة:	English
منشور في:	2022
الموضوعات:	Machine learning Deep learning (Machine learning) Computer vision

مواد مشابهة

Sign language understanding using multimodal learning
حسب: Momeni, L
منشور في: (2024)

Video understanding using multimodal deep learning
حسب: Nagrani, A
منشور في: (2020)

Deep vision for indoor understanding and localisation
حسب: Howard-Jenkins, H
منشور في: (2022)

END TO END LEARNING FOR A DRIVING SIMULATOR
حسب: V. F. Alexeev, وآخرون
منشور في: (2019-06-01)

Self-Supervised Learning for Audio-Visual Relationships of Videos With Stereo Sounds
حسب: Tomoya Sato, وآخرون
منشور في: (2022-01-01)

Towards unified visual perception
حسب: Sun, S
منشور في: (2024)

Learning to understand large-scale 3D point clouds
حسب: Qingyong, H
منشور في: (2022)

Understanding video through the lens of language
حسب: Bain, M
منشور في: (2023)

Learning object-centric representations
حسب: Kosiorek, AR
منشور في: (2019)

Self-supervised video representation learning
حسب: Han, T
منشور في: (2022)

Self-supervised and cross-modal learning from videos
حسب: Koepke, AS
منشور في: (2019)

Learning with multimodal self-supervision
حسب: Chen, H
منشور في: (2021)

Learning dense prediction: from correspondence to segmentation
حسب: Zhang, F
منشور في: (2022)

Audio-visual deep learning
حسب: Afouras, T, وآخرون
منشور في: (2021)

Visual recognition in art using machine learning
حسب: Crowley, E
منشور في: (2017)

Machine Learning and End-to-End Deep Learning for Monitoring Driver Distractions From Physiological and Visual Signals
حسب: Martin Gjoreski, وآخرون
منشور في: (2020-01-01)

Self-supervised learning using motion and visualizing convolutional neural networks
حسب: Mahendran, A
منشور في: (2018)

Learning Non-Parametric Surrogate Losses With Correlated Gradients
حسب: Seungdong Yoa, وآخرون
منشور في: (2021-01-01)

Towards diverse generation and reliable classification using neural networks
حسب: Kulharia, V
منشور في: (2022)

Developing object perception in the low data regime
حسب: Kaul, P
منشور في: (2024)

Measuring the security of computer vision systems to adversarial attacks
حسب: Lovisotto, G
منشور في: (2022)

An Interpretable Deep Learning-Based Feature Reduction in Video-Based Human Activity Recognition
حسب: Micheal Dutt, وآخرون
منشور في: (2024-01-01)

Learning visual concepts with fewer human annotations
حسب: Ehrhardt, S
منشور في: (2020)

Machine learning for audio, image and video analysis : theory and applications /
حسب: Camastra, Francesco, 1960-, وآخرون
منشور في: (2008)

End-to-End, Pixel-Wise Vessel-Specific Coronary and Aortic Calcium Detection and Scoring Using Deep Learning
حسب: Gurpreet Singh, وآخرون
منشور في: (2021-02-01)

Fusion of Visual and Audio Signals for Wildlife Surveillance
حسب: Cheng Hao Ng, وآخرون
منشور في: (2022-11-01)

Unsupervised learning of clutter-resistant visual representations from natural videos
حسب: Liao, Qianli, وآخرون
منشور في: (2015)

Advancing machine learning in astrophysics
حسب: Walmsley, M
منشور في: (2021)

Robust 2D and 3D registration with deep neural networks
حسب: Wang, Z
منشور في: (2024)

Holistic image understanding with deep learning and dense random fields
حسب: Zheng, S
منشور في: (2016)

End-to-end DVB-S2X system design with deep learning-based channel estimation over satellite fading channels
حسب: Mfarej, Sumaya Dhari Awad
منشور في: (2021)

A Survey on Audio-Video Based Defect Detection Through Deep Learning in Railway Maintenance
حسب: Lorenzo De Donato, وآخرون
منشور في: (2022-01-01)

Deep Learning Architecture Reduction for fMRI Data
حسب: Ruben Alvarez-Gonzalez, وآخرون
منشور في: (2022-02-01)

Using Deep Neural Networks for Human Fall Detection Based on Pose Estimation
حسب: Mohammadamin Salimi, وآخرون
منشور في: (2022-06-01)

Reinforcement learning for rule selection in end to end differentiable proving
حسب: Morris, M
منشور في: (2021)

On the Use of Deep Reinforcement Learning for Visual Tracking: A Survey
حسب: Giorgio Cruciata, وآخرون
منشور في: (2021-01-01)

Deep learning in computer vision: A critical review of emerging techniques and application scenarios
حسب: Junyi Chai, وآخرون
منشور في: (2021-12-01)

Corrigendum: Deep Plant Phenomics: A Deep Learning Platform for Complex Plant Phenotyping Tasks
حسب: Jordan R. Ubbens, وآخرون
منشور في: (2018-01-01)

Automatic plant phenotyping analysis of Melon (Cucumis melo L.) germplasm resources using deep learning methods and computer vision
حسب: Shan Xu, وآخرون
منشور في: (2024-10-01)

Looking deep at people: towards understanding and generating humans in images with deep learning
حسب: de Bem, RA
منشور في: (2018)