Markov decision processes with observation costs: framework and computation with a penalty scheme

We consider Markov decision processes where the state of the chain is only given at chosen observation times and of a cost. Optimal strategies involve the optimisation of observation times as well as the subsequent action values. We consider the finite horizon and discounted infinite horizon problem...

সম্পূর্ণ বিবরণ

গ্রন্থ-পঞ্জীর বিবরন
প্রধান লেখক:	Reisinger, C, Tam, J
বিন্যাস:	Journal article
ভাষা:	English
প্রকাশিত:	INFORMS 2024

Markov decision processes with observation costs: framework and computation with a penalty scheme

অনুরূপ উপাদানগুলি