A learning-based synthesis approach of reward asynchronous probabilistic games against the linear temporal logic winning condition
The traditional synthesis problem is usually solved by constructing a system that fulfills given specifications. The system is constantly interacting with the environment and is opposed to the environment. The problem can be further regarded as solving a two-player game (the system and its environme...
Main Authors: | Wei Zhao, Zhiming Liu |
---|---|
Format: | Article |
Language: | English |
Published: |
PeerJ Inc.
2022-09-01
|
Series: | PeerJ Computer Science |
Subjects: | |
Online Access: | https://peerj.com/articles/cs-1094.pdf |
Similar Items
-
Neural function underlying reward expectancy and attainment in adolescents with diverse psychiatric symptoms
by: Qi Liu, et al.
Published: (2022-01-01) -
Investigation and Identification Components of Compensation in Police Force based on Total Reward Model
by: Monire Alipour Madarsara, et al.
Published: (2019-11-01) -
Learning Potential in Subgoal-Based Reward Shaping
by: Takato Okudo, et al.
Published: (2023-01-01) -
GR(1)-Guided Deep Reinforcement Learning for Multi-Task Motion Planning under a Stochastic Environment
by: Chenyang Zhu, et al.
Published: (2022-11-01) -
Cyberspace attack and defense game based on reward randomization reinforcement learning
by: Lei Zhang, et al.
Published: (2022-12-01)