Structured Diffusion Processes in Deep Generative Models

Diffusion generative models have emerged as a powerful, versatile, and elegant generative modeling framework for diverse data modalities. However, the high computational cost of inference relative to other frameworks remains a chief limitation of such models. At the same time, the design space of a...

Full description

Bibliographic Details
Main Author: Jing, Bowen
Other Authors: Jaakkola, Tommi
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/147277
Description
Summary:Diffusion generative models have emerged as a powerful, versatile, and elegant generative modeling framework for diverse data modalities. However, the high computational cost of inference relative to other frameworks remains a chief limitation of such models. At the same time, the design space of a key component in their formulation—the forward diffusion process—has been underexplored. This thesis proposes a paradigm to accelerate and improve diffusion generative models by tailoring structured forward diffusion processes to the generative modeling problem at hand. Case studies of structured diffusion processes are developed and presented for (1) natural images and (2) molecular conformers. First, the subspace structure in images is exploited to develop subspace diffusion, a forward diffusion process that restricts the diffusion via projections to subspaces of decreasing dimensionality. Second, chemical constraints in molecular conformers are exploited to develop torsional diffusion, a forward process that preserves those constraints by operating over a lower-dimensional, non-Euclidean space. Both approaches simultaneously improve sample quality and reduce inference runtime while preserving existing capabilities—and developing new ones—of diffusion generative models.