Combinatorial Methods in Statistics

This thesis explores combinatorial methods in random vector balancing, nonparametric estimation, and network inference. First, motivated by problems from controlled experiments, we study random vector balancing from the perspective of discrepancy theory, a classical topic in combinatorics, and give...

Full description

Bibliographic Details
Main Author: Turner, Paxton Mark
Other Authors: Philippe Rigollet
Format: Thesis
Published: Massachusetts Institute of Technology 2022
Online Access:https://hdl.handle.net/1721.1/139383
_version_ 1826207828621131776
author Turner, Paxton Mark
author2 Philippe Rigollet
author_facet Philippe Rigollet
Turner, Paxton Mark
author_sort Turner, Paxton Mark
collection MIT
description This thesis explores combinatorial methods in random vector balancing, nonparametric estimation, and network inference. First, motivated by problems from controlled experiments, we study random vector balancing from the perspective of discrepancy theory, a classical topic in combinatorics, and give sharp statistical results along with improved algorithmic guarantees. Next, we focus on the problem of density estimation and investigate the fundamental statistical limits of coresets, a popular framework for obtaining algorithmic speedups by replacing a large dataset with a representative subset. In the following chapter, motivated by the problem of fast evaluation of kernel density estimators, we demonstrate how a multivariate interpolation scheme from finite-element theory based on the combinatorial-geometric properties of a certain mesh can be used to significantly improve the storage and query time of a nonparametric estimator while also preserving its accuracy. Our final chapter focuses on pedigree reconstruction, a combinatorial inference task of recovering the latent network of familial relationships of a population from its extant genetic data.
first_indexed 2024-09-23T13:55:36Z
format Thesis
id mit-1721.1/139383
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T13:55:36Z
publishDate 2022
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1393832022-01-15T03:23:43Z Combinatorial Methods in Statistics Turner, Paxton Mark Philippe Rigollet Massachusetts Institute of Technology. Department of Mathematics This thesis explores combinatorial methods in random vector balancing, nonparametric estimation, and network inference. First, motivated by problems from controlled experiments, we study random vector balancing from the perspective of discrepancy theory, a classical topic in combinatorics, and give sharp statistical results along with improved algorithmic guarantees. Next, we focus on the problem of density estimation and investigate the fundamental statistical limits of coresets, a popular framework for obtaining algorithmic speedups by replacing a large dataset with a representative subset. In the following chapter, motivated by the problem of fast evaluation of kernel density estimators, we demonstrate how a multivariate interpolation scheme from finite-element theory based on the combinatorial-geometric properties of a certain mesh can be used to significantly improve the storage and query time of a nonparametric estimator while also preserving its accuracy. Our final chapter focuses on pedigree reconstruction, a combinatorial inference task of recovering the latent network of familial relationships of a population from its extant genetic data. Ph.D. 2022-01-14T15:08:16Z 2022-01-14T15:08:16Z 2021-06 2021-05-25T12:47:41.836Z Thesis https://hdl.handle.net/1721.1/139383 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Turner, Paxton Mark
Combinatorial Methods in Statistics
title Combinatorial Methods in Statistics
title_full Combinatorial Methods in Statistics
title_fullStr Combinatorial Methods in Statistics
title_full_unstemmed Combinatorial Methods in Statistics
title_short Combinatorial Methods in Statistics
title_sort combinatorial methods in statistics
url https://hdl.handle.net/1721.1/139383
work_keys_str_mv AT turnerpaxtonmark combinatorialmethodsinstatistics