Counterfactual off-policy evaluation with gumbel-max structural causal models

Counterfactual off-policy evaluation with gumbel-max structural causal models

We introduce an off-policy evaluation procedure for highlighting episodes where applying a reinforcement learned (RL) policy is likely to have produced a substantially different outcome than the observed policy. In particular, we introduce a class of structural causal models (SCMs) for generating co...

Full description

Bibliographic Details
Main Authors:	Oberst, Michael, Sontag, David Alexander
Other Authors:	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format:	Article
Language:	English
Published:	MLResearch Press 2021
Online Access:	https://hdl.handle.net/1721.1/130437

Similar Items

Counterfactual policy introspection using structural causal models
by: Oberst, Michael Karl.
Published: (2020)

A counterfactual simulation model of causal judgments for physical events.
by: Gerstenberg, Tobias, et al.
Published: (2021)

Melacak distribusi Gumbel
by: Perpustakaan UGM, i-lib
Published: (1984)

Counterfactual: An R Package for Counterfactual Analysis
by: Chen, Mingli, et al.
Published: (2019)

Analyses of prior selections for Gumbel distribution
by: Rostami, Mohammad, et al.
Published: (2013)

Markov chain Monte Carlo convergence diagnostics for Gumbel model
by: Mohd Amin, Nor Azrita, et al.
Published: (2016)

Counterfactuals and Probability
by: Khoo, Justin
Published: (2022)

Statistical Inference on the Modified Gumbel Distribution Parameters
by: Hurairah, Ahmed Ali Omar
Published: (2006)

Hybrid conditional plot of goodness-of-fit for gumbel distribution (Plot bersyarat hibrid bagi ujian kebagusan penyuaian untuk taburan gumbel)
by: Nahdiya Zainal Abidin,, et al.
Published: (2012)

Hybrid conditional plot of goodness-of-fit for Gumbel distribution
by: Zainal Abidin, Nahdiya, et al.
Published: (2012)

Counterfactual quantum cryptography
by: Henry, Carrin
Published: (2021)

Counterfactual Thoughts in Photography
by: Reinhuber, Elke
Published: (2017)

Inference on counterfactual distributions
by: Chernozhukov, Victor, et al.
Published: (2011)

Inference on counterfactual distributions
by: Chernozhukov, Victor, et al.
Published: (2011)

Inference on Counterfactual Distributions
by: Chernozhukov, Victor V., et al.
Published: (2015)

Backtracking Counterfactuals Revisited
by: Khoo, Justin Donald
Published: (2018)

Counterfactual, prevention and causal thinking about workplace slip and trip accidents : a study of safety professionals, managers and accident subjects
by: Lehane, Paul Michael
Published: (2015)

Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes
by: Boominathan, Soorajnath, et al.
Published: (2021)

The trajectory of counterfactual simulation in development.
by: Kominsky, JF, et al.
Published: (2022)

Causal effect inference with deep latent-variable models
by: Louizos, Christos, et al.
Published: (2022)

Modeling downward counterfactual events : unrealized disasters and why they matter
by: Lin, Yolanda C., et al.
Published: (2020)

Seismic Hazard Assessment For Peninsular Malaysia Using Gumbel Distribution Method
by: Adnan, Azlan, et al.
Published: (2005)

Impact of dependence on parameter estimates of autoregressive process with Gumbel distributed innovation
by: Samuel, Bako Sunday, et al.
Published: (2018)

The skyline of counterfactual explanations for machine learning decision models
by: Wang, Yongjie, et al.
Published: (2022)

Counterfactual explanations for machine learning models on heterogeneous data
by: Wang, Yongjie
Published: (2023)

Nonparametric Counterfactual Predictions in Neoclassical Models of International Trade
by: Adao, Rodrigo, et al.
Published: (2018)

More data means less inference: A pseudo-max approach to structured learning
by: Sontag, David, et al.
Published: (2011)

Symmetricity between the sampling distribution of coefficient of variations, CVc and CVr for Gumbel samples.
by: Maarof, Fauziah, et al.

Trace-free counterfactual communication with a nanophotonic processor
by: Arvidsson Shukur, David Roland, et al.
Published: (2021)

Counterfactual explanations on the changes in foreign exchange market
by: Sng, Rhys Yi
Published: (2024)

What if This Modified That? Syntactic Interventions with Counterfactual Embeddings
by: Tucker, Mycal, et al.
Published: (2023)

Counterfactuals, dispositions, and conscious experience : essays on entropy
by: Elga, Adam Newman, 1974-
Published: (2009)

Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation
by: Vendrow, Joshua L.
Published: (2024)

Celebrating successful earthquake risk reduction through counterfactual probabilistic analysis
by: Rabonza, Maricar, et al.
Published: (2021)

Counterfactual Learning To Rank for Utility-Maximizing Query Autocompletion
by: Block, Adam, et al.
Published: (2022)

Identifying causal effects of public policies
by: Kling, Jeffrey R
Published: (2005)

Counterfactual samples synthesizing and training for robust visual question answering
by: Chen, Long, et al.
Published: (2023)

Towards an understanding of the relationship between counterfactual thinking and career aspirations.
by: Teo, Haw Yin.
Published: (2011)

Counterfactual explanations for forex prediction using deep learning methods
by: Vinod, Vinay Krishnaa
Published: (2024)

An Exact and Robust Conformal Inference Method for Counterfactual and Synthetic Controls
by: Chernozhukov, Victor, et al.
Published: (2022)