Approximate Dynamic Programming Using Bellman Residual Elimination and Gaussian Process Regression

This paper presents an approximate policy iteration algorithm for solving infinite-horizon, discounted Markov decision processes (MDPs) for which a model of the system is available. The algorithm is similar in spirit to Bellman residual minimization methods. However, by using Gaussian process regres...

সম্পূর্ণ বিবরণ

গ্রন্থ-পঞ্জীর বিবরন
প্রধান লেখক:	How, Jonathan P., Bethke, Brett M.
অন্যান্য লেখক:	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
বিন্যাস:	প্রবন্ধ
ভাষা:	en_US
প্রকাশিত:	Institute of Electrical and Electronics Engineers 2010
অনলাইন ব্যবহার করুন:	http://hdl.handle.net/1721.1/58907 https://orcid.org/0000-0001-8576-1930

আন্তর্জাল

http://hdl.handle.net/1721.1/58907
https://orcid.org/0000-0001-8576-1930

Approximate Dynamic Programming Using Bellman Residual Elimination and Gaussian Process Regression

আন্তর্জাল

অনুরূপ উপাদানগুলি