Differentiable visual computing

This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.

Bibliographic Details
Main Author:	Lo, Tzu-Mao
Other Authors:	Frédo Durand.
Format:	Thesis
Language:	eng
Published:	Massachusetts Institute of Technology 2019
Subjects:	Electrical Engineering and Computer Science.
Online Access:	https://hdl.handle.net/1721.1/122486

_version_	1826192831754010624
author	Lo, Tzu-Mao
author2	Frédo Durand.
author_facet	Frédo Durand. Lo, Tzu-Mao
author_sort	Lo, Tzu-Mao
collection	MIT
description	This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.
first_indexed	2024-09-23T09:29:32Z
format	Thesis
id	mit-1721.1/122486
institution	Massachusetts Institute of Technology
language	eng
last_indexed	2024-09-23T09:29:32Z
publishDate	2019
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/1224862020-01-16T13:47:52Z Differentiable visual computing Lo, Tzu-Mao Frédo Durand. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Electrical Engineering and Computer Science. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2019 Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 131-148). Derivatives of computer graphics, image processing, and deep learning algorithms have tremendous use in guiding parameter space searches, or solving inverse problems. As the algorithms become more sophisticated, we no longer only need to differentiate simple mathematical functions, but have to deal with general programs which encode complex transformations of data. This dissertation introduces three tools, for addressing the challenges that arise when obtaining and applying the derivatives for complex graphics algorithms. Traditionally, practitioners have been constrained to composing programs with a limited set of coarse-grained operators, or hand-deriving derivatives. We extend the image processing language Halide with reverse-mode automatic differentiation, and the ability to automatically optimize the gradient computations. This enables automatic generation of the gradients of arbitrary Halide programs, at high performance, with little programmer effort. We demonstrate several applications, including how our system enables quality improvements of even traditional, feed-forward image processing algorithms, blurring the distinction between classical and deep learning methods. In 3D rendering, the gradient is required with respect to variables such as camera parameters, light sources, geometry, and appearance. However, computing the gradient is challenging because the rendering integral includes visibility terms that are not differentiable. We introduce, to our knowledge, the first general-purpose differentiable ray tracer that solves the full rendering equation, while correctly taking the geometric discontinuities into account. We show prototype applications in inverse rendering and the generation of adversarial examples for neural networks. Finally, we demonstrate that the derivatives of light path throughput, especially the second-order ones, can also be useful for guiding sampling in forward rendering. Simulating light transport in the presence of multi-bounce glossy effects and motion in 3D rendering is challenging due to the high-dimensional integrand and narrow high-contribution areas. We extend the Metropolis Light Transport algorithm by adapting to the local shape of the integrand, thereby increasing sampling efficiency. In particular, the Hessian is able to capture the strong anisotropy of the integrand. We use ideas from Hamiltonian Monte Carlo and simulate physics in Taylor expansion to draw samples from high-contribution region. by Tzu-Mao Lo. Ph. D. Ph.D. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science 2019-10-11T21:14:38Z 2019-10-11T21:14:38Z 2019 2019 Thesis https://hdl.handle.net/1721.1/122486 1122780012 eng MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582 148 pages application/pdf Massachusetts Institute of Technology
spellingShingle	Electrical Engineering and Computer Science. Lo, Tzu-Mao Differentiable visual computing
title	Differentiable visual computing
title_full	Differentiable visual computing
title_fullStr	Differentiable visual computing
title_full_unstemmed	Differentiable visual computing
title_short	Differentiable visual computing
title_sort	differentiable visual computing
topic	Electrical Engineering and Computer Science.
url	https://hdl.handle.net/1721.1/122486
work_keys_str_mv	AT lotzumao differentiablevisualcomputing

Differentiable visual computing

Similar Items