Holistic image understanding with deep learning and dense random fields

<p>One aim of holistic image understanding is not only to recognise the things and stuff in images but also to localise where they are exactly. Semantic image segmentation is set up to achieve this goal. The purpose of this task is to recognise and delineate the visual objects. The solution to...

Celý popis

Podrobná bibliografie
Hlavní autor:	Zheng, S
Další autoři:	Torr, P
Médium:	Diplomová práce
Jazyk:	English
Vydáno:	2016
Témata:	Computer science Machine Learning Computer Vision

_version_	1826315844806770688
author	Zheng, S
author2	Torr, P
author_facet	Torr, P Zheng, S
author_sort	Zheng, S
collection	OXFORD
description	<p>One aim of holistic image understanding is not only to recognise the things and stuff in images but also to localise where they are exactly. Semantic image segmentation is set up to achieve this goal. The purpose of this task is to recognise and delineate the visual objects. The solution to this task provides detailed information to understand images and is central to applications such as content-based image search, autonomous vehicles, image-editing, and smart glasses for partially-sighted people. This task is challenging to address not only because the visual objects from the same category could have a variety of appearances but also because of the need to account for contextual information across images such as edges and appearance consistency. The objective of this thesis is to bridge the gap between the pixel-based image representation in computer devices and the meaning extracted by humans.</p> <p>Our primary contributions are fourfold. Firstly, we propose a factorial fully-connected conditional random field that addresses the problem of jointly estimating the segmentation for both object class and visual attributes. Secondly, we embed the proposed factorial fully-connected conditional random fields framework in an interactive image segmentation system. This system allows users to refine the semantic image segmentation with verbal instructions. Thirdly, we formulate filter-based mean-field approximate inference for fully-connected conditional random fields with Gaussian pairwise potentials as a recurrent neural network. This formulation allows us to integrate fully convolutional neural networks and conditional random fields in an end-to-end trainable system. Fourthly, we show the relationship between fully-connected conditional random fields with Gaussian pairwise potentials and iterative Graph-cut: We found that fully-connected conditional random fields with Gaussian Pairwise potential implicitly model the unnormalised global colour models for foreground and background.</p>
first_indexed	2024-03-06T19:33:11Z
format	Thesis
id	oxford-uuid:1e29bfc5-16ed-44fa-a75c-40d215035fee
institution	University of Oxford
language	English
last_indexed	2024-12-09T03:33:37Z
publishDate	2016
record_format	dspace
spelling	oxford-uuid:1e29bfc5-16ed-44fa-a75c-40d215035fee2024-12-01T16:49:19ZHolistic image understanding with deep learning and dense random fieldsThesishttp://purl.org/coar/resource_type/c_db06uuid:1e29bfc5-16ed-44fa-a75c-40d215035feeComputer scienceMachine LearningComputer VisionEnglishORA Deposit2016Zheng, STorr, P<p>One aim of holistic image understanding is not only to recognise the things and stuff in images but also to localise where they are exactly. Semantic image segmentation is set up to achieve this goal. The purpose of this task is to recognise and delineate the visual objects. The solution to this task provides detailed information to understand images and is central to applications such as content-based image search, autonomous vehicles, image-editing, and smart glasses for partially-sighted people. This task is challenging to address not only because the visual objects from the same category could have a variety of appearances but also because of the need to account for contextual information across images such as edges and appearance consistency. The objective of this thesis is to bridge the gap between the pixel-based image representation in computer devices and the meaning extracted by humans.</p> <p>Our primary contributions are fourfold. Firstly, we propose a factorial fully-connected conditional random field that addresses the problem of jointly estimating the segmentation for both object class and visual attributes. Secondly, we embed the proposed factorial fully-connected conditional random fields framework in an interactive image segmentation system. This system allows users to refine the semantic image segmentation with verbal instructions. Thirdly, we formulate filter-based mean-field approximate inference for fully-connected conditional random fields with Gaussian pairwise potentials as a recurrent neural network. This formulation allows us to integrate fully convolutional neural networks and conditional random fields in an end-to-end trainable system. Fourthly, we show the relationship between fully-connected conditional random fields with Gaussian pairwise potentials and iterative Graph-cut: We found that fully-connected conditional random fields with Gaussian Pairwise potential implicitly model the unnormalised global colour models for foreground and background.</p>
spellingShingle	Computer science Machine Learning Computer Vision Zheng, S Holistic image understanding with deep learning and dense random fields
title	Holistic image understanding with deep learning and dense random fields
title_full	Holistic image understanding with deep learning and dense random fields
title_fullStr	Holistic image understanding with deep learning and dense random fields
title_full_unstemmed	Holistic image understanding with deep learning and dense random fields
title_short	Holistic image understanding with deep learning and dense random fields
title_sort	holistic image understanding with deep learning and dense random fields
topic	Computer science Machine Learning Computer Vision
work_keys_str_mv	AT zhengs holisticimageunderstandingwithdeeplearninganddenserandomfields

Holistic image understanding with deep learning and dense random fields

Podobné jednotky