Markov Decision Processes with Multiple Long-run Average Objectives
We study Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) functions. We consider two different objectives, namely, expectation and satisfaction objectives. Given an MDP with k limit-average functions, in the expectation objective the goal is to maximize the expected limi...
Asıl Yazarlar: | , , , , |
---|---|
Materyal Türü: | Conference item |
Baskı/Yayın Bilgisi: |
IEEE
2011
|