Matching with semi-bandits

We consider an experimental setting in which a matching of resources to participants has to be chosen repeatedly and returns from the individual chosen matches are unknown, but can be learned. Our setting covers two-sided and one-sided matching with (potentially complex) capacity constraints, such a...

Full description

Bibliographic Details
Main Authors:	Kasy, M, Teytelboym, A
Format:	Journal article
Language:	English
Published:	Oxford University Press 2022

_version_	1797109699891429376
author	Kasy, M Teytelboym, A
author_facet	Kasy, M Teytelboym, A
author_sort	Kasy, M
collection	OXFORD
description	We consider an experimental setting in which a matching of resources to participants has to be chosen repeatedly and returns from the individual chosen matches are unknown, but can be learned. Our setting covers two-sided and one-sided matching with (potentially complex) capacity constraints, such as refugee resettlement, social housing allocation, and foster care. We propose a variant of the Thompson sampling algorithm to solve such adaptive combinatorial allocation problems. We give a tight, prior-independent, finite-sample bound on the expected regret for this algorithm. Although the number of allocations grows exponentially in the number of matches, our bound does not. In simulations based on refugee resettlement data using a Bayesian hierarchical model, we find that the algorithm achieves half of the employment gains (relative to the status quo) that could be obtained in an optimal matching based on perfect knowledge of employment probabilities.
first_indexed	2024-03-07T07:43:39Z
format	Journal article
id	oxford-uuid:41bf0ee4-654d-46b7-9e6c-838dabaaac78
institution	University of Oxford
language	English
last_indexed	2024-03-07T07:43:39Z
publishDate	2022
publisher	Oxford University Press
record_format	dspace
spelling	oxford-uuid:41bf0ee4-654d-46b7-9e6c-838dabaaac782023-05-25T08:56:33ZMatching with semi-banditsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:41bf0ee4-654d-46b7-9e6c-838dabaaac78EnglishSymplectic ElementsOxford University Press2022Kasy, MTeytelboym, AWe consider an experimental setting in which a matching of resources to participants has to be chosen repeatedly and returns from the individual chosen matches are unknown, but can be learned. Our setting covers two-sided and one-sided matching with (potentially complex) capacity constraints, such as refugee resettlement, social housing allocation, and foster care. We propose a variant of the Thompson sampling algorithm to solve such adaptive combinatorial allocation problems. We give a tight, prior-independent, finite-sample bound on the expected regret for this algorithm. Although the number of allocations grows exponentially in the number of matches, our bound does not. In simulations based on refugee resettlement data using a Bayesian hierarchical model, we find that the algorithm achieves half of the employment gains (relative to the status quo) that could be obtained in an optimal matching based on perfect knowledge of employment probabilities.
spellingShingle	Kasy, M Teytelboym, A Matching with semi-bandits
title	Matching with semi-bandits
title_full	Matching with semi-bandits
title_fullStr	Matching with semi-bandits
title_full_unstemmed	Matching with semi-bandits
title_short	Matching with semi-bandits
title_sort	matching with semi bandits
work_keys_str_mv	AT kasym matchingwithsemibandits AT teytelboyma matchingwithsemibandits

Matching with semi-bandits

Similar Items