Lasso Methods for Gaussian Instrumental Variables Models

In this note, we propose the use of sparse methods (e.g., LASSO, Post-LASSO, p LASSO, and Post-p LASSO) to form first-stage predictions and estimate optimal instruments in linear instrumental variables (IV) models with many instruments in the canonical Gaussian case. The methods apply even when the...

Full description

Bibliographic Details
Main Authors: Belloni, Alexandre, Chernozhukov, Victor, Hansen, Christian
Format: Working Paper
Language:en_US
Published: Cambridge, MA: Department of Economics, Massachusetts Institute of Technology 2011
Online Access:http://hdl.handle.net/1721.1/65142
Description
Summary:In this note, we propose the use of sparse methods (e.g., LASSO, Post-LASSO, p LASSO, and Post-p LASSO) to form first-stage predictions and estimate optimal instruments in linear instrumental variables (IV) models with many instruments in the canonical Gaussian case. The methods apply even when the number of instruments is much larger than the sample size. We derive asymptotic distributions for the resulting IV estimators and provide conditions under which these sparsity-based IV estimators are asymptotically oracle-efficient. In simulation experiments, a sparsity-based IV estimator with a data-driven penalty performs well compared to recently advocated many-instrument-robust procedures. We illustrate the procedure in an empirical example using the Angrist and Krueger (1991) schooling data.