Online hierarchical reinforcement learning based on interrupting Option

Aiming at dealing with volume of big data,an on-line updating algorithm,named by Macro-Q with in-place updating (MQIU),which was based on Macro-Q algorithm and takes advantage of in-place updating approach,was proposed.The MQIU algorithm updates both the value function of abstract action and the val...

Celý popis

Podrobná bibliografie
Hlavní autoři:	Fei ZHU, Zhi-peng XU, Quan LIU, Yu-chen FU, Hui WANG
Médium:	Článek
Jazyk:	zho
Vydáno:	Editorial Department of Journal on Communications 2016-06-01
Edice:	Tongxin xuebao
Témata:	big data;reinforcement learning;hierarchical reinforcement learning;Option;online learning
On-line přístup:	http://www.joconline.com.cn/thesisDetails#10.11959/j.issn.1000-436x.2016117

Internet

http://www.joconline.com.cn/thesisDetails#10.11959/j.issn.1000-436x.2016117

Online hierarchical reinforcement learning based on interrupting Option

Internet

Podobné jednotky