A learning-based synthesis approach of reward asynchronous probabilistic games against the linear temporal logic winning condition
The traditional synthesis problem is usually solved by constructing a system that fulfills given specifications. The system is constantly interacting with the environment and is opposed to the environment. The problem can be further regarded as solving a two-player game (the system and its environme...
Main Authors: | Wei Zhao, Zhiming Liu |
---|---|
Format: | Article |
Language: | English |
Published: |
PeerJ Inc.
2022-09-01
|
Series: | PeerJ Computer Science |
Subjects: | |
Online Access: | https://peerj.com/articles/cs-1094.pdf |
Similar Items
-
Noise-Correlation Is Modulated by Reward Expectation in the Primary Motor Cortex Bilaterally During Manual and Observational Tasks in Primates
by: Brittany Moore, et al.
Published: (2020-12-01) -
THE INFLUENCE OF REWARDS ON EMPLOYEE PERFORMANCE WITH REWARDS SEPARATION AS MODERATING VARIABLE
by: Khoirul Khuluq, et al.
Published: (2019-01-01) -
Neural function underlying reward expectancy and attainment in adolescents with diverse psychiatric symptoms
by: Qi Liu, et al.
Published: (2022-01-01) -
GR(1)-Guided Deep Reinforcement Learning for Multi-Task Motion Planning under a Stochastic Environment
by: Chenyang Zhu, et al.
Published: (2022-11-01) -
Learning Potential in Subgoal-Based Reward Shaping
by: Takato Okudo, et al.
Published: (2023-01-01)