Randomized graph cluster randomization

The global average treatment effect (GATE) is a primary quantity of interest in the study of causal inference under network interference. With a correctly specified exposure model of the interference, the Horvitz–Thompson (HT) and Hájek estimators of the GATE are unbiased and consistent, respectivel...

Full description

Bibliographic Details
Main Authors: Ugander Johan, Yin Hao
Format: Article
Language:English
Published: De Gruyter 2023-05-01
Series:Journal of Causal Inference
Subjects:
Online Access:https://doi.org/10.1515/jci-2022-0014
_version_ 1797814144498401280
author Ugander Johan
Yin Hao
author_facet Ugander Johan
Yin Hao
author_sort Ugander Johan
collection DOAJ
description The global average treatment effect (GATE) is a primary quantity of interest in the study of causal inference under network interference. With a correctly specified exposure model of the interference, the Horvitz–Thompson (HT) and Hájek estimators of the GATE are unbiased and consistent, respectively, yet known to exhibit extreme variance under many designs and in many settings of interest. With a fixed clustering of the interference graph, graph cluster randomization (GCR) designs have been shown to greatly reduce variance compared to node-level random assignment, but even so the variance is still often prohibitively large. In this work, we propose a randomized version of the GCR design, descriptively named randomized graph cluster randomization (RGCR), which uses a random clustering rather than a single fixed clustering. By considering an ensemble of many different clustering assignments, this design avoids a key problem with GCR where the network exposure probability of a given node can be exponentially small in a single clustering. We propose two inherently randomized graph decomposition algorithms for use with RGCR designs, randomized 3-net and 1-hop-max, adapted from the prior work on multiway graph cut problems and the probabilistic approximation of (graph) metrics. We also propose weighted extensions of these two algorithms with slight additional advantages. All these algorithms result in network exposure probabilities that can be estimated efficiently. We derive structure-dependent upper bounds on the variance of the HT estimator of the GATE, depending on the metric structure of the graph driving the interference. Where the best-known such upper bound for the HT estimator under a GCR design is exponential in the parameters of the metric structure, we give a comparable upper bound under RGCR that is instead polynomial in the same parameters. We provide extensive simulations comparing RGCR and GCR designs, observing substantial improvements in GATE estimation in a variety of settings.
first_indexed 2024-03-13T08:03:13Z
format Article
id doaj.art-6ba80024e7844b95965e18515e889d0b
institution Directory Open Access Journal
issn 2193-3685
language English
last_indexed 2024-03-13T08:03:13Z
publishDate 2023-05-01
publisher De Gruyter
record_format Article
series Journal of Causal Inference
spelling doaj.art-6ba80024e7844b95965e18515e889d0b2023-06-01T09:43:03ZengDe GruyterJournal of Causal Inference2193-36852023-05-011118324210.1515/jci-2022-0014Randomized graph cluster randomizationUgander Johan0Yin Hao1Management Science and Engineering, Stanford University, Stanford, CA 94305, California, United StatesInstitute for Computational and Mathematical Engineering, Stanford University, Stanford, CA 94305, California, United StatesThe global average treatment effect (GATE) is a primary quantity of interest in the study of causal inference under network interference. With a correctly specified exposure model of the interference, the Horvitz–Thompson (HT) and Hájek estimators of the GATE are unbiased and consistent, respectively, yet known to exhibit extreme variance under many designs and in many settings of interest. With a fixed clustering of the interference graph, graph cluster randomization (GCR) designs have been shown to greatly reduce variance compared to node-level random assignment, but even so the variance is still often prohibitively large. In this work, we propose a randomized version of the GCR design, descriptively named randomized graph cluster randomization (RGCR), which uses a random clustering rather than a single fixed clustering. By considering an ensemble of many different clustering assignments, this design avoids a key problem with GCR where the network exposure probability of a given node can be exponentially small in a single clustering. We propose two inherently randomized graph decomposition algorithms for use with RGCR designs, randomized 3-net and 1-hop-max, adapted from the prior work on multiway graph cut problems and the probabilistic approximation of (graph) metrics. We also propose weighted extensions of these two algorithms with slight additional advantages. All these algorithms result in network exposure probabilities that can be estimated efficiently. We derive structure-dependent upper bounds on the variance of the HT estimator of the GATE, depending on the metric structure of the graph driving the interference. Where the best-known such upper bound for the HT estimator under a GCR design is exponential in the parameters of the metric structure, we give a comparable upper bound under RGCR that is instead polynomial in the same parameters. We provide extensive simulations comparing RGCR and GCR designs, observing substantial improvements in GATE estimation in a variety of settings.https://doi.org/10.1515/jci-2022-0014causal inference under interferenceglobal average treatment effectnetwork effectsspilloverssocial networks62d05
spellingShingle Ugander Johan
Yin Hao
Randomized graph cluster randomization
Journal of Causal Inference
causal inference under interference
global average treatment effect
network effects
spillovers
social networks
62d05
title Randomized graph cluster randomization
title_full Randomized graph cluster randomization
title_fullStr Randomized graph cluster randomization
title_full_unstemmed Randomized graph cluster randomization
title_short Randomized graph cluster randomization
title_sort randomized graph cluster randomization
topic causal inference under interference
global average treatment effect
network effects
spillovers
social networks
62d05
url https://doi.org/10.1515/jci-2022-0014
work_keys_str_mv AT uganderjohan randomizedgraphclusterrandomization
AT yinhao randomizedgraphclusterrandomization