The Balance-Sample Size Frontier in Matching Methods for Causal Inference

We propose a simplified approach to matching for causal inference that simultaneously optimizes balance (similarity between the treated and control groups) and matched sample size. Existing approaches either fix the matched sample size and maximize balance or fix balance and maximize sample size, le...

Full description

Bibliographic Details
Main Authors: King, Gary, Lucas, Christopher, Nielsen, Richard Alexander
Other Authors: Massachusetts Institute of Technology. Department of Political Science
Format: Article
Published: Wiley 2018
Online Access:http://hdl.handle.net/1721.1/118640
https://orcid.org/0000-0003-0205-5227
Description
Summary:We propose a simplified approach to matching for causal inference that simultaneously optimizes balance (similarity between the treated and control groups) and matched sample size. Existing approaches either fix the matched sample size and maximize balance or fix balance and maximize sample size, leaving analysts to settle for suboptimal solutions or attempt manual optimization by iteratively tweaking their matching method and rechecking balance. To jointly maximize balance and sample size, we introduce the matching frontier, the set of matching solutions with maximum possible balance for each sample size. Rather than iterating, researchers can choose matching solutions from the frontier for analysis in one step. We derive fast algorithms that calculate the matching frontier for several commonly used balance metrics. We demonstrate this approach with analyses of the effect of sex on judging and job training programs that show how the methods we introduce can extract new knowledge from existing data sets.