Application-driven Intersections between Information Theory and Machine Learning

Machine learning has been tremendously successful in the past decade. In this thesis, we introduce guidance and insights from information theory to practical machine learning algorithms. In particular, we study three application domains and demonstrate the algorithmic gain of integrating machine lea...

Volledige beschrijving

Bibliografische gegevens
Hoofdauteur:	Liu, Litian
Andere auteurs:	Médard, Muriel
Formaat:	Thesis
Gepubliceerd in:	Massachusetts Institute of Technology 2022
Online toegang:	https://hdl.handle.net/1721.1/139138

Omschrijving
Samenvatting:	Machine learning has been tremendously successful in the past decade. In this thesis, we introduce guidance and insights from information theory to practical machine learning algorithms. In particular, we study three application domains and demonstrate the algorithmic gain of integrating machine learning with information theory. In the first part of the thesis, we deploy the principle of network coding to propose a decomposition scheme for distributing a neural network over a physical communication network. We show through experiments that our proposed scheme dramatically reduces the energy used compared to existing communication schemes under various channel statistics and network topologies. In the second part, we design a learning-based coding scheme, developed from the concept of error correction codes, for bio-molecular profiling. We show through simulations that, with a learning-based encoder and a maximize a posterior (MAP) decoder, our scheme significantly outperforms existing schemes in reducing the false negative rate of rare bio-molecular types. In the third part, we exercise guesswork on the machine translation problem. We study machine translation using the seq2seq model and we provide insights into quantifying the uncertainty within. Our results shed light on the design of inference in machine translation for selecting the beam size in beam search.

Application-driven Intersections between Information Theory and Machine Learning

Gelijkaardige items