Bayesian persuasion in sequential decision-making

We study a dynamic model of Bayesian persuasion in sequential decision-making settings. An informed principal observes an external parameter of the world and advises an uninformed agent about actions to take over time. The agent takes actions in each time step based on the current state, the princip...

Full description

Bibliographic Details
Main Authors:	Gan, J, Majumdar, R, Radanovic, G, Singla, A
Format:	Conference item
Language:	English
Published:	AAAI Press 2022

_version_	1797107262328668160
author	Gan, J Majumdar, R Radanovic, G Singla, A
author_facet	Gan, J Majumdar, R Radanovic, G Singla, A
author_sort	Gan, J
collection	OXFORD
description	We study a dynamic model of Bayesian persuasion in sequential decision-making settings. An informed principal observes an external parameter of the world and advises an uninformed agent about actions to take over time. The agent takes actions in each time step based on the current state, the principal's advice/signal, and beliefs about the external parameter. The action of the agent updates the state according to a stochastic process. The model arises naturally in many applications, e.g., an app (the principal) can advice the user (the agent) on possible choices between actions based on additional real-time information the app has. We study the problem of designing a signaling strategy from the principal's point of view. We show that the principal has an optimal strategy against a myopic agent, who only optimizes their rewards locally, and the optimal strategy can be computed in polynomial time. In contrast, it is NP-hard to approximate an optimal policy against a far-sighted agent. Further, we show that if the principal has the power to threaten the agent by not providing future signals, then we can efficiently design a threat-based strategy. This strategy guarantees the principal's payoff as if playing against an agent who is far-sighted but myopic to future signals.
first_indexed	2024-03-07T07:13:35Z
format	Conference item
id	oxford-uuid:3d94202a-3518-4ccf-9846-e6985e5351f8
institution	University of Oxford
language	English
last_indexed	2024-03-07T07:13:35Z
publishDate	2022
publisher	AAAI Press
record_format	dspace
spelling	oxford-uuid:3d94202a-3518-4ccf-9846-e6985e5351f82022-07-22T09:57:27ZBayesian persuasion in sequential decision-makingConference itemhttp://purl.org/coar/resource_type/c_5794uuid:3d94202a-3518-4ccf-9846-e6985e5351f8EnglishSymplectic ElementsAAAI Press2022Gan, JMajumdar, RRadanovic, GSingla, AWe study a dynamic model of Bayesian persuasion in sequential decision-making settings. An informed principal observes an external parameter of the world and advises an uninformed agent about actions to take over time. The agent takes actions in each time step based on the current state, the principal's advice/signal, and beliefs about the external parameter. The action of the agent updates the state according to a stochastic process. The model arises naturally in many applications, e.g., an app (the principal) can advice the user (the agent) on possible choices between actions based on additional real-time information the app has. We study the problem of designing a signaling strategy from the principal's point of view. We show that the principal has an optimal strategy against a myopic agent, who only optimizes their rewards locally, and the optimal strategy can be computed in polynomial time. In contrast, it is NP-hard to approximate an optimal policy against a far-sighted agent. Further, we show that if the principal has the power to threaten the agent by not providing future signals, then we can efficiently design a threat-based strategy. This strategy guarantees the principal's payoff as if playing against an agent who is far-sighted but myopic to future signals.
spellingShingle	Gan, J Majumdar, R Radanovic, G Singla, A Bayesian persuasion in sequential decision-making
title	Bayesian persuasion in sequential decision-making
title_full	Bayesian persuasion in sequential decision-making
title_fullStr	Bayesian persuasion in sequential decision-making
title_full_unstemmed	Bayesian persuasion in sequential decision-making
title_short	Bayesian persuasion in sequential decision-making
title_sort	bayesian persuasion in sequential decision making
work_keys_str_mv	AT ganj bayesianpersuasioninsequentialdecisionmaking AT majumdarr bayesianpersuasioninsequentialdecisionmaking AT radanovicg bayesianpersuasioninsequentialdecisionmaking AT singlaa bayesianpersuasioninsequentialdecisionmaking

Bayesian persuasion in sequential decision-making

Similar Items