The Sorted Effects Method: Discovering Heterogeneous Effects Beyond Their Averages

© 2018 The Econometric Society The partial (ceteris paribus) effects of interest in nonlinear and interactive linear models are heterogeneous as they can vary dramatically with the underlying observed or unobserved covariates. Despite the apparent importance of heterogeneity, a common practice in mo...

Full description

Bibliographic Details
Main Authors: Chernozhukov, Victor, Fernández-Val, Iván, Luo, Ye
Other Authors: Massachusetts Institute of Technology. Department of Economics
Format: Article
Language:English
Published: The Econometric Society 2021
Online Access:https://hdl.handle.net/1721.1/134897
_version_ 1811072659801243648
author Chernozhukov, Victor
Fernández-Val, Iván
Luo, Ye
author2 Massachusetts Institute of Technology. Department of Economics
author_facet Massachusetts Institute of Technology. Department of Economics
Chernozhukov, Victor
Fernández-Val, Iván
Luo, Ye
author_sort Chernozhukov, Victor
collection MIT
description © 2018 The Econometric Society The partial (ceteris paribus) effects of interest in nonlinear and interactive linear models are heterogeneous as they can vary dramatically with the underlying observed or unobserved covariates. Despite the apparent importance of heterogeneity, a common practice in modern empirical work is to largely ignore it by reporting average partial effects (or, at best, average effects for some groups). While average effects provide very convenient scalar summaries of typical effects, by definition they fail to reflect the entire variety of the heterogeneous effects. In order to discover these effects much more fully, we propose to estimate and report sorted effects—a collection of estimated partial effects sorted in increasing order and indexed by percentiles. By construction, the sorted effect curves completely represent and help visualize the range of the heterogeneous effects in one plot. They are as convenient and easy to report in practice as the conventional average partial effects. They also serve as a basis for classification analysis, where we divide the observational units into most or least affected groups and summarize their characteristics. We provide a quantification of uncertainty (standard errors and confidence bands) for the estimated sorted effects and related classification analysis, and provide confidence sets for the most and least affected groups. The derived statistical results rely on establishing key, new mathematical results on Hadamard differentiability of a multivariate sorting operator and a related classification operator, which are of independent interest. We apply the sorted effects method and classification analysis to demonstrate several striking patterns in the gender wage gap. We find that this gap is particularly strong for married women, ranging from −60% to 0% between the 2% and 98% percentiles, as a function of observed and unobserved characteristics; while the gap for never married women ranges from −40% to +20%. The most adversely affected women tend to be married, do not have college degrees, work in sales, and have high levels of potential experience.
first_indexed 2024-09-23T09:09:31Z
format Article
id mit-1721.1/134897
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T09:09:31Z
publishDate 2021
publisher The Econometric Society
record_format dspace
spelling mit-1721.1/1348972024-01-02T15:52:16Z The Sorted Effects Method: Discovering Heterogeneous Effects Beyond Their Averages Chernozhukov, Victor Fernández-Val, Iván Luo, Ye Massachusetts Institute of Technology. Department of Economics © 2018 The Econometric Society The partial (ceteris paribus) effects of interest in nonlinear and interactive linear models are heterogeneous as they can vary dramatically with the underlying observed or unobserved covariates. Despite the apparent importance of heterogeneity, a common practice in modern empirical work is to largely ignore it by reporting average partial effects (or, at best, average effects for some groups). While average effects provide very convenient scalar summaries of typical effects, by definition they fail to reflect the entire variety of the heterogeneous effects. In order to discover these effects much more fully, we propose to estimate and report sorted effects—a collection of estimated partial effects sorted in increasing order and indexed by percentiles. By construction, the sorted effect curves completely represent and help visualize the range of the heterogeneous effects in one plot. They are as convenient and easy to report in practice as the conventional average partial effects. They also serve as a basis for classification analysis, where we divide the observational units into most or least affected groups and summarize their characteristics. We provide a quantification of uncertainty (standard errors and confidence bands) for the estimated sorted effects and related classification analysis, and provide confidence sets for the most and least affected groups. The derived statistical results rely on establishing key, new mathematical results on Hadamard differentiability of a multivariate sorting operator and a related classification operator, which are of independent interest. We apply the sorted effects method and classification analysis to demonstrate several striking patterns in the gender wage gap. We find that this gap is particularly strong for married women, ranging from −60% to 0% between the 2% and 98% percentiles, as a function of observed and unobserved characteristics; while the gap for never married women ranges from −40% to +20%. The most adversely affected women tend to be married, do not have college degrees, work in sales, and have high levels of potential experience. 2021-10-27T20:09:44Z 2021-10-27T20:09:44Z 2018 2019-10-22T16:08:56Z Article http://purl.org/eprint/type/JournalArticle https://hdl.handle.net/1721.1/134897 en 10.3982/ECTA14415 Econometrica Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf The Econometric Society arXiv
spellingShingle Chernozhukov, Victor
Fernández-Val, Iván
Luo, Ye
The Sorted Effects Method: Discovering Heterogeneous Effects Beyond Their Averages
title The Sorted Effects Method: Discovering Heterogeneous Effects Beyond Their Averages
title_full The Sorted Effects Method: Discovering Heterogeneous Effects Beyond Their Averages
title_fullStr The Sorted Effects Method: Discovering Heterogeneous Effects Beyond Their Averages
title_full_unstemmed The Sorted Effects Method: Discovering Heterogeneous Effects Beyond Their Averages
title_short The Sorted Effects Method: Discovering Heterogeneous Effects Beyond Their Averages
title_sort sorted effects method discovering heterogeneous effects beyond their averages
url https://hdl.handle.net/1721.1/134897
work_keys_str_mv AT chernozhukovvictor thesortedeffectsmethoddiscoveringheterogeneouseffectsbeyondtheiraverages
AT fernandezvalivan thesortedeffectsmethoddiscoveringheterogeneouseffectsbeyondtheiraverages
AT luoye thesortedeffectsmethoddiscoveringheterogeneouseffectsbeyondtheiraverages
AT chernozhukovvictor sortedeffectsmethoddiscoveringheterogeneouseffectsbeyondtheiraverages
AT fernandezvalivan sortedeffectsmethoddiscoveringheterogeneouseffectsbeyondtheiraverages
AT luoye sortedeffectsmethoddiscoveringheterogeneouseffectsbeyondtheiraverages