Matching with semi-bandits
We consider an experimental setting in which a matching of resources to participants has to be chosen repeatedly and returns from the individual chosen matches are unknown, but can be learned. Our setting covers two-sided and one-sided matching with (potentially complex) capacity constraints, such a...
Main Authors: | , |
---|---|
Format: | Journal article |
Language: | English |
Published: |
Oxford University Press
2022
|
_version_ | 1797109699891429376 |
---|---|
author | Kasy, M Teytelboym, A |
author_facet | Kasy, M Teytelboym, A |
author_sort | Kasy, M |
collection | OXFORD |
description | We consider an experimental setting in which a matching of resources to participants has to be chosen repeatedly and returns from the individual chosen matches are unknown, but can be learned. Our setting covers two-sided and one-sided matching with (potentially complex) capacity constraints, such as refugee resettlement, social housing allocation, and foster care. We propose a variant of the Thompson sampling algorithm to solve such adaptive combinatorial allocation problems. We give a tight, prior-independent, finite-sample bound on the expected regret for this algorithm. Although the number of allocations grows exponentially in the number of matches, our bound does not. In simulations based on refugee resettlement data using a Bayesian hierarchical model, we find that the algorithm achieves half of the employment gains (relative to the status quo) that could be obtained in an optimal matching based on perfect knowledge of employment probabilities. |
first_indexed | 2024-03-07T07:43:39Z |
format | Journal article |
id | oxford-uuid:41bf0ee4-654d-46b7-9e6c-838dabaaac78 |
institution | University of Oxford |
language | English |
last_indexed | 2024-03-07T07:43:39Z |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | dspace |
spelling | oxford-uuid:41bf0ee4-654d-46b7-9e6c-838dabaaac782023-05-25T08:56:33ZMatching with semi-banditsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:41bf0ee4-654d-46b7-9e6c-838dabaaac78EnglishSymplectic ElementsOxford University Press2022Kasy, MTeytelboym, AWe consider an experimental setting in which a matching of resources to participants has to be chosen repeatedly and returns from the individual chosen matches are unknown, but can be learned. Our setting covers two-sided and one-sided matching with (potentially complex) capacity constraints, such as refugee resettlement, social housing allocation, and foster care. We propose a variant of the Thompson sampling algorithm to solve such adaptive combinatorial allocation problems. We give a tight, prior-independent, finite-sample bound on the expected regret for this algorithm. Although the number of allocations grows exponentially in the number of matches, our bound does not. In simulations based on refugee resettlement data using a Bayesian hierarchical model, we find that the algorithm achieves half of the employment gains (relative to the status quo) that could be obtained in an optimal matching based on perfect knowledge of employment probabilities. |
spellingShingle | Kasy, M Teytelboym, A Matching with semi-bandits |
title | Matching with semi-bandits |
title_full | Matching with semi-bandits |
title_fullStr | Matching with semi-bandits |
title_full_unstemmed | Matching with semi-bandits |
title_short | Matching with semi-bandits |
title_sort | matching with semi bandits |
work_keys_str_mv | AT kasym matchingwithsemibandits AT teytelboyma matchingwithsemibandits |