Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition

Bibliographic Details
Main Authors: Jin, Chi, Jin, Tiancheng, Luo, Haipeng, Sra, Suvrit, Yu, Tiancheng
Other Authors: Massachusetts Institute of Technology. Institute for Data, Systems, and Society
Format: Article
Language:English
Published: 2022
Online Access:https://hdl.handle.net/1721.1/143895
_version_ 1811072917068316672
author Jin, Chi
Jin, Tiancheng
Luo, Haipeng
Sra, Suvrit
Yu, Tiancheng
author2 Massachusetts Institute of Technology. Institute for Data, Systems, and Society
author_facet Massachusetts Institute of Technology. Institute for Data, Systems, and Society
Jin, Chi
Jin, Tiancheng
Luo, Haipeng
Sra, Suvrit
Yu, Tiancheng
author_sort Jin, Chi
collection MIT
first_indexed 2024-09-23T09:21:32Z
format Article
id mit-1721.1/143895
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T09:21:32Z
publishDate 2022
record_format dspace
spelling mit-1721.1/1438952023-02-01T16:56:36Z Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition Jin, Chi Jin, Tiancheng Luo, Haipeng Sra, Suvrit Yu, Tiancheng Massachusetts Institute of Technology. Institute for Data, Systems, and Society Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science 2022-07-20T16:41:40Z 2022-07-20T16:41:40Z 2020 2022-07-20T16:37:52Z Article http://purl.org/eprint/type/ConferencePaper https://hdl.handle.net/1721.1/143895 Jin, Chi, Jin, Tiancheng, Luo, Haipeng, Sra, Suvrit and Yu, Tiancheng. 2020. "Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition." INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 119. en https://proceedings.mlr.press/v119/jin20c.html INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119 Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. application/pdf Proceedings of Machine Learning Research
spellingShingle Jin, Chi
Jin, Tiancheng
Luo, Haipeng
Sra, Suvrit
Yu, Tiancheng
Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
title Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
title_full Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
title_fullStr Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
title_full_unstemmed Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
title_short Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
title_sort learning adversarial markov decision processes with bandit feedback and unknown transition
url https://hdl.handle.net/1721.1/143895
work_keys_str_mv AT jinchi learningadversarialmarkovdecisionprocesseswithbanditfeedbackandunknowntransition
AT jintiancheng learningadversarialmarkovdecisionprocesseswithbanditfeedbackandunknowntransition
AT luohaipeng learningadversarialmarkovdecisionprocesseswithbanditfeedbackandunknowntransition
AT srasuvrit learningadversarialmarkovdecisionprocesseswithbanditfeedbackandunknowntransition
AT yutiancheng learningadversarialmarkovdecisionprocesseswithbanditfeedbackandunknowntransition