Recognizing describable attributes of textures and materials in the wild and clutter

<p>Visual textures play an important role in image understanding because theyare a key component of the semantic of many images. Furthermore, texture representations, which pool local image descriptors in an orderless manner, have hada tremendous impact in a wide range of computer vision probl...

Mô tả đầy đủ

Chi tiết về thư mục
Tác giả chính: Cimpoi, M
Tác giả khác: Vedaldi, A
Định dạng: Luận văn
Ngôn ngữ:English
Được phát hành: 2015
Những chủ đề:
_version_ 1826281696443498496
author Cimpoi, M
author2 Vedaldi, A
author_facet Vedaldi, A
Cimpoi, M
author_sort Cimpoi, M
collection OXFORD
description <p>Visual textures play an important role in image understanding because theyare a key component of the semantic of many images. Furthermore, texture representations, which pool local image descriptors in an orderless manner, have hada tremendous impact in a wide range of computer vision problems, from texture recognition to object detection. In this thesis we make several contributions to the area of texture understanding.</p> <p>First, we add a new semantic dimension to texture recognition. Instead of focusing on instance or material recognition, we propose a human-interpretable vocabulary of texture attributes, inspired from studies in Cognitive Science, to describe common texture patterns. We also develop a corresponding dataset, the <em>Describable Texture Dataset</em> (DTD), for benchmarking. We show that these texture attributes produce intuitive descriptions of textures. We also show that they can be used to extract a very low dimensional representation of any texture that is very effective in other texture analysis tasks, including improving the state-of-the art in material recognition on the most challenging datasets available today.</p> <p>Second, we look at the problem of recognizing texture attributes and materials in realistic uncontrolled imaging conditions, including when textures appear in clutter. We build on top of the recently proposed Open Surfaces dataset, introduced by the graphics community, by deriving a corresponding benchmarks for material recognition. In addition to material labels, we also augment a subset of Open Surfaces with semantic attributes.</p> <p>Third, we propose a novel texture representation, combining the recent advances in deep-learning with the power of Fisher Vector pooling. We provide thorough evaluation of the new representation, and revisit in general classic texture representations, including bag-of-visual-words, VLAD and the Fisher Vectors, in the context of deep learning. We show that these pooling mechanisms have excellent efficiency and generalisation properties if the convolutional layers of a deep model are used as local features. We obtain in this manner state-of-the-art performance in numerous datasets, both in texture recognition and image understanding in general. We show through our experiments that the proposed representation is an efficient way to apply deep features to image regions, and that it is an effective manner of transferring deep features from one domain to another.</p>
first_indexed 2024-03-07T00:32:41Z
format Thesis
id oxford-uuid:805cb25c-61b4-4c84-9abf-c82ea2b64495
institution University of Oxford
language English
last_indexed 2024-03-07T00:32:41Z
publishDate 2015
record_format dspace
spelling oxford-uuid:805cb25c-61b4-4c84-9abf-c82ea2b644952022-03-26T21:22:46ZRecognizing describable attributes of textures and materials in the wild and clutterThesishttp://purl.org/coar/resource_type/c_db06uuid:805cb25c-61b4-4c84-9abf-c82ea2b64495Information engineeringImage understandingEnglishOxford University Research Archive - Valet2015Cimpoi, MVedaldi, A<p>Visual textures play an important role in image understanding because theyare a key component of the semantic of many images. Furthermore, texture representations, which pool local image descriptors in an orderless manner, have hada tremendous impact in a wide range of computer vision problems, from texture recognition to object detection. In this thesis we make several contributions to the area of texture understanding.</p> <p>First, we add a new semantic dimension to texture recognition. Instead of focusing on instance or material recognition, we propose a human-interpretable vocabulary of texture attributes, inspired from studies in Cognitive Science, to describe common texture patterns. We also develop a corresponding dataset, the <em>Describable Texture Dataset</em> (DTD), for benchmarking. We show that these texture attributes produce intuitive descriptions of textures. We also show that they can be used to extract a very low dimensional representation of any texture that is very effective in other texture analysis tasks, including improving the state-of-the art in material recognition on the most challenging datasets available today.</p> <p>Second, we look at the problem of recognizing texture attributes and materials in realistic uncontrolled imaging conditions, including when textures appear in clutter. We build on top of the recently proposed Open Surfaces dataset, introduced by the graphics community, by deriving a corresponding benchmarks for material recognition. In addition to material labels, we also augment a subset of Open Surfaces with semantic attributes.</p> <p>Third, we propose a novel texture representation, combining the recent advances in deep-learning with the power of Fisher Vector pooling. We provide thorough evaluation of the new representation, and revisit in general classic texture representations, including bag-of-visual-words, VLAD and the Fisher Vectors, in the context of deep learning. We show that these pooling mechanisms have excellent efficiency and generalisation properties if the convolutional layers of a deep model are used as local features. We obtain in this manner state-of-the-art performance in numerous datasets, both in texture recognition and image understanding in general. We show through our experiments that the proposed representation is an efficient way to apply deep features to image regions, and that it is an effective manner of transferring deep features from one domain to another.</p>
spellingShingle Information engineering
Image understanding
Cimpoi, M
Recognizing describable attributes of textures and materials in the wild and clutter
title Recognizing describable attributes of textures and materials in the wild and clutter
title_full Recognizing describable attributes of textures and materials in the wild and clutter
title_fullStr Recognizing describable attributes of textures and materials in the wild and clutter
title_full_unstemmed Recognizing describable attributes of textures and materials in the wild and clutter
title_short Recognizing describable attributes of textures and materials in the wild and clutter
title_sort recognizing describable attributes of textures and materials in the wild and clutter
topic Information engineering
Image understanding
work_keys_str_mv AT cimpoim recognizingdescribableattributesoftexturesandmaterialsinthewildandclutter