Machine learning of image analysis with convolutional networks and topological constraints

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, 2010.

Bibliographic Details
Main Author: Jain, Viren
Other Authors: H. Sebastian Seung.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2010
Subjects:
Online Access:http://hdl.handle.net/1721.1/57546
_version_ 1826203827601145856
author Jain, Viren
author2 H. Sebastian Seung.
author_facet H. Sebastian Seung.
Jain, Viren
author_sort Jain, Viren
collection MIT
description Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, 2010.
first_indexed 2024-09-23T12:43:50Z
format Thesis
id mit-1721.1/57546
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T12:43:50Z
publishDate 2010
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/575462019-04-12T15:19:12Z Machine learning of image analysis with convolutional networks and topological constraints Jain, Viren H. Sebastian Seung. Massachusetts Institute of Technology. Dept. of Brain and Cognitive Sciences. Massachusetts Institute of Technology. Dept. of Brain and Cognitive Sciences. Brain and Cognitive Sciences. Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, 2010. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student submitted PDF version of thesis. Includes bibliographical references (p. 130-140). We present an approach to solving computer vision problems in which the goal is to produce a high-dimensional, pixel-based interpretation of some aspect of the underlying structure of an image. Such tasks have traditionally been categorized as ''low-level vision'' problems, and examples include image denoising, boundary detection, and motion estimation. Our approach is characterized by two main elements, both of which represent a departure from previous work. The first is a focus on convolutional networks, a machine learning strategy that operates directly on an input image with no use of hand-designed features and employs many thousands of free parameters that are learned from data. Previous work in low-level vision has been largely focused on completely hand-designed algorithms or learning methods with a hand-designed feature space. We demonstrate that a learning approach with high model complexity, but zero prior knowledge about any specific image domain, can outperform existing techniques even in the challenging area of natural image processing. We also present results that establish how convolutional networks are closely related to Markov random fields (MRFs), a popular probabilistic approach to image analysis, but can in practice can achieve significantly greater model complexity. The second aspect of our approach is the use of domain specific cost functions and learning algorithms that reflect the structured nature of certain prediction problems in image analysis. (cont.) In particular, we show how concepts from digital topology can be used in the context of boundary detection to both evaluate and optimize the high-order property of topological accuracy. We demonstrate that these techniques can significantly improve the machine learning approach and outperform state of the art boundary detection and segmentation methods. Throughout our work we maintain a special interest and focus on application of our methods to connectomics, an emerging scientific discipline that seeks high-throughput methods for recovering neural connectivity data from brains. This application requires solving low-level image analysis problems on a tera-voxel or peta-voxel scale, and therefore represents an extremely challenging and exciting arena for the development of computer vision methods. by Viren Jain. Ph.D. 2010-08-26T15:22:21Z 2010-08-26T15:22:21Z 2010 2010 Thesis http://hdl.handle.net/1721.1/57546 639292432 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 140 p. application/pdf Massachusetts Institute of Technology
spellingShingle Brain and Cognitive Sciences.
Jain, Viren
Machine learning of image analysis with convolutional networks and topological constraints
title Machine learning of image analysis with convolutional networks and topological constraints
title_full Machine learning of image analysis with convolutional networks and topological constraints
title_fullStr Machine learning of image analysis with convolutional networks and topological constraints
title_full_unstemmed Machine learning of image analysis with convolutional networks and topological constraints
title_short Machine learning of image analysis with convolutional networks and topological constraints
title_sort machine learning of image analysis with convolutional networks and topological constraints
topic Brain and Cognitive Sciences.
url http://hdl.handle.net/1721.1/57546
work_keys_str_mv AT jainviren machinelearningofimageanalysiswithconvolutionalnetworksandtopologicalconstraints