OFFER: Off-environment reinforcement learning

OFFER: Off-environment reinforcement learning

Policy gradient methods have been widely applied in reinforcement learning. For reasons of safety and cost, learning is often conducted using a simulator. However, learning in simulation does not traditionally utilise the opportunity to improve learning by adjusting certain environment variables - s...

Descrición completa

Detalles Bibliográficos
Main Authors:	Ciosek, K, Whiteson, S
Formato:	Conference item
Idioma:	English
Publicado:	AAAI Press 2017

Títulos similares

Expected policy gradients for reinforcement learning
por: Ciosek, K, et al.
Publicado: (2020)

Robust reinforcement learning with Bayesian optimisation and quadrature
por: Paul, S, et al.
Publicado: (2020)

Expected policy gradients
por: Ciosek, K, et al.
Publicado: (2018)

Loaded DiCE: Trading off bias and variance in any-order score function gradient estimators for reinforcement learning
por: Farquhar, G, et al.
Publicado: (2019)

Fourier policy gradients
por: Fellows, M, et al.
Publicado: (2018)

Off-Dynamics Inverse Reinforcement Learning
por: Yachen Kang, et al.
Publicado: (2024-01-01)

Inverse reinforcement learning from failure
por: Shiarlis, K, et al.
Publicado: (2016)

Deep residual reinforcement learning
por: Zhang, S, et al.
Publicado: (2020)

Reinforcement Learning system to capture value from Brazilian post-harvest offers
por: Fernando Henrique Lermen, et al.
Publicado: (2024-12-01)

Learning retrospective knowledge with reverse reinforcement learning
por: Zhang, S, et al.
Publicado: (2020)

Off-policy reinforcement learning with Gaussian processes
por: Chowdhary, Girish, et al.
Publicado: (2015)

Bayesian action decoder for deep multi-agent reinforcement learning
por: Whiteson, S
Publicado: (2019)

Fingerprint policy optimisation for robust reinforcement learning
por: Paul, S, et al.
Publicado: (2019)

Learning to communicate with Deep multi-agent reinforcement learning
por: Foerster, J, et al.
Publicado: (2016)

Deep variational reinforcement learning for POMDPs
por: Igl, M, et al.
Publicado: (2018)

Mean−variance policy iteration for risk−averse reinforcement learning
por: Zhang, S, et al.
Publicado: (2021)

Enhanced Off-Policy Reinforcement Learning With Focused Experience Replay
por: Seung-Hyun Kong, et al.
Publicado: (2021-01-01)

A UNIQUE COLLABORATION: USM FIRST-TIME OFFERING OFF-SHORE ARCHITECTURAL PROGRAMME
por: MPRC, Pusat Media & Perhubungan Awam
Publicado: (2015)

VIREL: A variational inference framework for reinforcement learning
por: Fellows, M, et al.
Publicado: (2019)

Alternating optimisation and quadrature for robust control
por: Paul, S, et al.
Publicado: (2018)

Generalized Off-Policy Actor-Critic
por: Zhang, S, et al.
Publicado: (2019)

The potential of offering HIV-related services in an optometry environment
por: Haseena Majid, et al.
Publicado: (2020-02-01)

What can catalysts offer for environment pollution control ? /
por: 604334 Wan Azelee Wan Abu Bakar, et al.
Publicado: (2001)

Z-Score Experience Replay in Off-Policy Deep Reinforcement Learning
por: Yana Yang, et al.
Publicado: (2024-12-01)

Off-Policy Meta-Reinforcement Learning With Belief-Based Task Inference
por: Takahisa Imagawa, et al.
Publicado: (2022-01-01)

Transient dynamics in trial-offer markets with social influence: Trade-offs between appeal and quality.
por: Edgar Altszyler, et al.
Publicado: (2017-01-01)

Deep Reinforcement Learning in complex environments
por: Nardelli, N
Publicado: (2021)

Reactive Reinforcement Learning in Asynchronous Environments
por: Jaden B. Travnik, et al.
Publicado: (2018-06-01)

Spirituality in offering a peace offering
por: Nobuyoshi Kiuchi
Publicado: (1999-05-01)

Transient non−stationarity and generalisation in deep reinforcement learning
por: Igl, M, et al.
Publicado: (2021)

Reinforcement learning based mainline dynamic speed limit adjustment of expressway off‐ramp upstream under connected and autonomous vehicles environment
por: Daiquan Xiao, et al.
Publicado: (2022-12-01)

Designing the Inclusive Built Environment: An Exploration of Opportunities Offered by ICTs
por: Emilia Conte
Publicado: (2017-07-01)

Exploration in approximate hyper-state space for meta reinforcement learning
por: Zintgraf, L, et al.
Publicado: (2021)

Reliability assessment of off-policy deep reinforcement learning: A benchmark for aerodynamics
por: Sandrine Berger, et al.
Publicado: (2024-01-01)

Optimal Control of Iron-Removal Systems Based on Off-Policy Reinforcement Learning
por: Ning Chen, et al.
Publicado: (2020-01-01)

Multi-agent common knowledge reinforcement learning
por: de Witt, C, et al.
Publicado: (2019)

TreeQN and ATreeC: differentiable tree planning for deep reinforcement learning
por: Farquhar, G, et al.
Publicado: (2018)

Reinforcement learning enhanced quantum-inspired algorithm for combinatorial optimization
por: Beloborodov, D, et al.
Publicado: (2020)

SCIENCE@ALORSETAR TO OFFER BLENDED LEARNING TO STUDENTS
por: MPRC, Pusat Media & Perhubungan Awam
Publicado: (2016)

Pioneering New Ways of Offering Learning Assistance
por: Deborah Parra
Publicado: (1997-10-01)