Combinatorial Methods in Statistics
This thesis explores combinatorial methods in random vector balancing, nonparametric estimation, and network inference. First, motivated by problems from controlled experiments, we study random vector balancing from the perspective of discrepancy theory, a classical topic in combinatorics, and give...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2022
|
Online Access: | https://hdl.handle.net/1721.1/139383 |
_version_ | 1826207828621131776 |
---|---|
author | Turner, Paxton Mark |
author2 | Philippe Rigollet |
author_facet | Philippe Rigollet Turner, Paxton Mark |
author_sort | Turner, Paxton Mark |
collection | MIT |
description | This thesis explores combinatorial methods in random vector balancing, nonparametric estimation, and network inference. First, motivated by problems from controlled experiments, we study random vector balancing from the perspective of discrepancy theory, a classical topic in combinatorics, and give sharp statistical results along with improved algorithmic guarantees. Next, we focus on the problem of density estimation and investigate the fundamental statistical limits of coresets, a popular framework for obtaining algorithmic speedups by replacing a large dataset with a representative subset. In the following chapter, motivated by the problem of fast evaluation of kernel density estimators, we demonstrate how a multivariate interpolation scheme from finite-element theory based on the combinatorial-geometric properties of a certain mesh can be used to significantly improve the storage and query time of a nonparametric estimator while also preserving its accuracy. Our final chapter focuses on pedigree reconstruction, a combinatorial inference task of recovering the latent network of familial relationships of a population from its extant genetic data. |
first_indexed | 2024-09-23T13:55:36Z |
format | Thesis |
id | mit-1721.1/139383 |
institution | Massachusetts Institute of Technology |
last_indexed | 2024-09-23T13:55:36Z |
publishDate | 2022 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/1393832022-01-15T03:23:43Z Combinatorial Methods in Statistics Turner, Paxton Mark Philippe Rigollet Massachusetts Institute of Technology. Department of Mathematics This thesis explores combinatorial methods in random vector balancing, nonparametric estimation, and network inference. First, motivated by problems from controlled experiments, we study random vector balancing from the perspective of discrepancy theory, a classical topic in combinatorics, and give sharp statistical results along with improved algorithmic guarantees. Next, we focus on the problem of density estimation and investigate the fundamental statistical limits of coresets, a popular framework for obtaining algorithmic speedups by replacing a large dataset with a representative subset. In the following chapter, motivated by the problem of fast evaluation of kernel density estimators, we demonstrate how a multivariate interpolation scheme from finite-element theory based on the combinatorial-geometric properties of a certain mesh can be used to significantly improve the storage and query time of a nonparametric estimator while also preserving its accuracy. Our final chapter focuses on pedigree reconstruction, a combinatorial inference task of recovering the latent network of familial relationships of a population from its extant genetic data. Ph.D. 2022-01-14T15:08:16Z 2022-01-14T15:08:16Z 2021-06 2021-05-25T12:47:41.836Z Thesis https://hdl.handle.net/1721.1/139383 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology |
spellingShingle | Turner, Paxton Mark Combinatorial Methods in Statistics |
title | Combinatorial Methods in Statistics |
title_full | Combinatorial Methods in Statistics |
title_fullStr | Combinatorial Methods in Statistics |
title_full_unstemmed | Combinatorial Methods in Statistics |
title_short | Combinatorial Methods in Statistics |
title_sort | combinatorial methods in statistics |
url | https://hdl.handle.net/1721.1/139383 |
work_keys_str_mv | AT turnerpaxtonmark combinatorialmethodsinstatistics |