End-to-end learning, and audio-visual human-centric video understanding

End-to-end learning, and audio-visual human-centric video understanding

<p>The field of machine learning has seen tremendous progress in the last decade, largely due to the advent of deep neural networks. When trained on large-scale labelled datasets, these machine learning algorithms can learn powerful semantic representations directly from the input data, end-to...

Mô tả đầy đủ

Chi tiết về thư mục
Tác giả chính:	Brown, A
Tác giả khác:	Zisserman, A
Định dạng:	Luận văn
Ngôn ngữ:	English
Được phát hành:	2022
Những chủ đề:	Machine learning Deep learning (Machine learning) Computer vision

Những quyển sách tương tự

Sign language understanding using multimodal learning
Bằng: Momeni, L
Được phát hành: (2024)

Video understanding using multimodal deep learning
Bằng: Nagrani, A
Được phát hành: (2020)

Deep vision for indoor understanding and localisation
Bằng: Howard-Jenkins, H
Được phát hành: (2022)

END TO END LEARNING FOR A DRIVING SIMULATOR
Bằng: V. F. Alexeev, et al.
Được phát hành: (2019-06-01)

Self-Supervised Learning for Audio-Visual Relationships of Videos With Stereo Sounds
Bằng: Tomoya Sato, et al.
Được phát hành: (2022-01-01)

Towards unified visual perception
Bằng: Sun, S
Được phát hành: (2024)

Learning to understand large-scale 3D point clouds
Bằng: Qingyong, H
Được phát hành: (2022)

Understanding video through the lens of language
Bằng: Bain, M
Được phát hành: (2023)

Learning object-centric representations
Bằng: Kosiorek, AR
Được phát hành: (2019)

Self-supervised video representation learning
Bằng: Han, T
Được phát hành: (2022)

Self-supervised and cross-modal learning from videos
Bằng: Koepke, AS
Được phát hành: (2019)

Learning with multimodal self-supervision
Bằng: Chen, H
Được phát hành: (2021)

Learning dense prediction: from correspondence to segmentation
Bằng: Zhang, F
Được phát hành: (2022)

Audio-visual deep learning
Bằng: Afouras, T, et al.
Được phát hành: (2021)

Visual recognition in art using machine learning
Bằng: Crowley, E
Được phát hành: (2017)

Machine Learning and End-to-End Deep Learning for Monitoring Driver Distractions From Physiological and Visual Signals
Bằng: Martin Gjoreski, et al.
Được phát hành: (2020-01-01)

Self-supervised learning using motion and visualizing convolutional neural networks
Bằng: Mahendran, A
Được phát hành: (2018)

Learning Non-Parametric Surrogate Losses With Correlated Gradients
Bằng: Seungdong Yoa, et al.
Được phát hành: (2021-01-01)

Towards diverse generation and reliable classification using neural networks
Bằng: Kulharia, V
Được phát hành: (2022)

Developing object perception in the low data regime
Bằng: Kaul, P
Được phát hành: (2024)

Measuring the security of computer vision systems to adversarial attacks
Bằng: Lovisotto, G
Được phát hành: (2022)

An Interpretable Deep Learning-Based Feature Reduction in Video-Based Human Activity Recognition
Bằng: Micheal Dutt, et al.
Được phát hành: (2024-01-01)

Learning visual concepts with fewer human annotations
Bằng: Ehrhardt, S
Được phát hành: (2020)

Machine learning for audio, image and video analysis : theory and applications /
Bằng: Camastra, Francesco, 1960-, et al.
Được phát hành: (2008)

End-to-End, Pixel-Wise Vessel-Specific Coronary and Aortic Calcium Detection and Scoring Using Deep Learning
Bằng: Gurpreet Singh, et al.
Được phát hành: (2021-02-01)

Fusion of Visual and Audio Signals for Wildlife Surveillance
Bằng: Cheng Hao Ng, et al.
Được phát hành: (2022-11-01)

Unsupervised learning of clutter-resistant visual representations from natural videos
Bằng: Liao, Qianli, et al.
Được phát hành: (2015)

Advancing machine learning in astrophysics
Bằng: Walmsley, M
Được phát hành: (2021)

Robust 2D and 3D registration with deep neural networks
Bằng: Wang, Z
Được phát hành: (2024)

Holistic image understanding with deep learning and dense random fields
Bằng: Zheng, S
Được phát hành: (2016)

End-to-end DVB-S2X system design with deep learning-based channel estimation over satellite fading channels
Bằng: Mfarej, Sumaya Dhari Awad
Được phát hành: (2021)

A Survey on Audio-Video Based Defect Detection Through Deep Learning in Railway Maintenance
Bằng: Lorenzo De Donato, et al.
Được phát hành: (2022-01-01)

Deep Learning Architecture Reduction for fMRI Data
Bằng: Ruben Alvarez-Gonzalez, et al.
Được phát hành: (2022-02-01)

Using Deep Neural Networks for Human Fall Detection Based on Pose Estimation
Bằng: Mohammadamin Salimi, et al.
Được phát hành: (2022-06-01)

Reinforcement learning for rule selection in end to end differentiable proving
Bằng: Morris, M
Được phát hành: (2021)

On the Use of Deep Reinforcement Learning for Visual Tracking: A Survey
Bằng: Giorgio Cruciata, et al.
Được phát hành: (2021-01-01)

Deep learning in computer vision: A critical review of emerging techniques and application scenarios
Bằng: Junyi Chai, et al.
Được phát hành: (2021-12-01)

Corrigendum: Deep Plant Phenomics: A Deep Learning Platform for Complex Plant Phenotyping Tasks
Bằng: Jordan R. Ubbens, et al.
Được phát hành: (2018-01-01)

Automatic plant phenotyping analysis of Melon (Cucumis melo L.) germplasm resources using deep learning methods and computer vision
Bằng: Shan Xu, et al.
Được phát hành: (2024-10-01)

Looking deep at people: towards understanding and generating humans in images with deep learning
Bằng: de Bem, RA
Được phát hành: (2018)