Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Online Access: | https://hdl.handle.net/1721.1/143895 |
_version_ | 1811072917068316672 |
---|---|
author | Jin, Chi Jin, Tiancheng Luo, Haipeng Sra, Suvrit Yu, Tiancheng |
author2 | Massachusetts Institute of Technology. Institute for Data, Systems, and Society |
author_facet | Massachusetts Institute of Technology. Institute for Data, Systems, and Society Jin, Chi Jin, Tiancheng Luo, Haipeng Sra, Suvrit Yu, Tiancheng |
author_sort | Jin, Chi |
collection | MIT |
first_indexed | 2024-09-23T09:21:32Z |
format | Article |
id | mit-1721.1/143895 |
institution | Massachusetts Institute of Technology |
language | English |
last_indexed | 2024-09-23T09:21:32Z |
publishDate | 2022 |
record_format | dspace |
spelling | mit-1721.1/1438952023-02-01T16:56:36Z Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition Jin, Chi Jin, Tiancheng Luo, Haipeng Sra, Suvrit Yu, Tiancheng Massachusetts Institute of Technology. Institute for Data, Systems, and Society Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science 2022-07-20T16:41:40Z 2022-07-20T16:41:40Z 2020 2022-07-20T16:37:52Z Article http://purl.org/eprint/type/ConferencePaper https://hdl.handle.net/1721.1/143895 Jin, Chi, Jin, Tiancheng, Luo, Haipeng, Sra, Suvrit and Yu, Tiancheng. 2020. "Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition." INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 119. en https://proceedings.mlr.press/v119/jin20c.html INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119 Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. application/pdf Proceedings of Machine Learning Research |
spellingShingle | Jin, Chi Jin, Tiancheng Luo, Haipeng Sra, Suvrit Yu, Tiancheng Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition |
title | Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition |
title_full | Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition |
title_fullStr | Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition |
title_full_unstemmed | Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition |
title_short | Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition |
title_sort | learning adversarial markov decision processes with bandit feedback and unknown transition |
url | https://hdl.handle.net/1721.1/143895 |
work_keys_str_mv | AT jinchi learningadversarialmarkovdecisionprocesseswithbanditfeedbackandunknowntransition AT jintiancheng learningadversarialmarkovdecisionprocesseswithbanditfeedbackandunknowntransition AT luohaipeng learningadversarialmarkovdecisionprocesseswithbanditfeedbackandunknowntransition AT srasuvrit learningadversarialmarkovdecisionprocesseswithbanditfeedbackandunknowntransition AT yutiancheng learningadversarialmarkovdecisionprocesseswithbanditfeedbackandunknowntransition |