Fourier policy gradients

Fourier policy gradients

We propose a new way of deriving policy gradient updates for reinforcement learning. Our technique, based on Fourier analysis, recasts integrals that arise with expected policy gradients as convolutions and turns them into multiplications. The obtained analytical solutions allow us to capture the lo...

Mô tả đầy đủ

Chi tiết về thư mục
Những tác giả chính:	Fellows, M, Ciosek, K, Whiteson, S
Định dạng:	Conference item
Được phát hành:	Journal of Machine Learning Research 2018

Những quyển sách tương tự

Expected policy gradients
Bằng: Ciosek, K, et al.
Được phát hành: (2018)

Expected policy gradients for reinforcement learning
Bằng: Ciosek, K, et al.
Được phát hành: (2020)

OFFER: Off-environment reinforcement learning
Bằng: Ciosek, K, et al.
Được phát hành: (2017)

Counterfactual multi−agent policy gradients
Bằng: Foerster, J, et al.
Được phát hành: (2018)

Fast efficient hyperparameter tuning for policy gradient methods
Bằng: Paul, S, et al.
Được phát hành: (2019)

Gradients of connectivity as graph Fourier bases of brain activity
Bằng: Giulia Lioi, et al.
Được phát hành: (2021-01-01)

FACMAC: Factored multi−agent centralised policy gradients
Bằng: Peng, B, et al.
Được phát hành: (2022)

THE IMAGE REGISTRATION OF FOURIER-MELLIN BASED ON THE COMBINATION OF PROJECTION AND GRADIENT PREPROCESSING
Bằng: D. Gao, et al.
Được phát hành: (2017-09-01)

REPRESENTATION OF GRADIENTS OF A SCALAR FIELD ON THE SPHERE USING A 2D FOURIER EXPRESSION
Bằng: M. A. Sharifi, et al.
Được phát hành: (2015-12-01)

Bayesian Bellman operators
Bằng: Fellows, M, et al.
Được phát hành: (2022)

Alternating optimisation and quadrature for robust control
Bằng: Paul, S, et al.
Được phát hành: (2018)

Extreme diffraction management in phase-corrected gradient metasurface by fourier harmonic component engineering
Bằng: Wang, Yuxiang, et al.
Được phát hành: (2023)

Application of the Fourier Series Expansion Method for the Inversion of Gravity Gradients using Gravity Anomalies
Bằng: Bei Liu, et al.
Được phát hành: (2022-12-01)

Gradients in the mammalian cerebellar cortex enable Fourier-like transformation and improve storing capacity
Bằng: Isabelle Straub, et al.
Được phát hành: (2020-02-01)

Iris Segmentation using Gradient Magnitude and Fourier Descriptor for Multimodal Biometric Authentication System
Bằng: Defiana Sulaeman, et al.
Được phát hành: (2016-10-01)

Robust reinforcement learning with Bayesian optimisation and quadrature
Bằng: Paul, S, et al.
Được phát hành: (2020)

GradientDICE: rethinking generalized offline estimation of stationary values
Bằng: Zhang, S, et al.
Được phát hành: (2020)

VIREL: A variational inference framework for reinforcement learning
Bằng: Fellows, M, et al.
Được phát hành: (2019)

On Quantum Natural Policy Gradients
Bằng: Andre Sequeira, et al.
Được phát hành: (2024-01-01)

Multileave gradient descent for fast online learning to rank
Bằng: Whiteson, S, et al.
Được phát hành: (2016)

Fourier series /
Bằng: 354160 Ritt, Robert K.
Được phát hành: (1970)

Fourier ellipsometry – an ellipsometric approach to Fourier scatterometry
Bằng: Petrik P., et al.
Được phát hành: (2015-01-01)

Fourier transform /
Bằng: 393526 Bochner, Salomon, et al.
Được phát hành: (1949)

Trainability issues in quantum policy gradients
Bằng: André Sequeira, et al.
Được phát hành: (2024-01-01)

Fourier transform, fourier sine and cosine transforms /
Bằng: Nurul 'Aqilah Mohd Hashim, et al.
Được phát hành: (2011)

FOURIER2D and FOURIER3D : programs to demonstrate Fourier synthesis in crystallography
Bằng: Glazer, A
Được phát hành: (2016)

Energy and Environmental Policy Trends: Indirect Carbon Tax Costs Reduced by Policy Design
Bằng: G. Kent Fellows, et al.
Được phát hành: (2023-06-01)

Laplace and fourier transforms
Bằng: 391553 Goyal, J. K., et al.

Applied Fourier transform /
Bằng: Morita, K
Được phát hành: (1995)

Fourier BEM : generalization of boundary element methods by Fourier transform /
Bằng: Duddeck, Fabian M.E., 1965-
Được phát hành: (2002)

Thermal characteristics of longitudinal fin with Fourier and non-Fourier heat transfer by Fourier sine transforms
Bằng: Basma Souayeh, et al.
Được phát hành: (2021-12-01)

Energy and Environmental Policy Trends: The Invisible Cost of Pipeline Constraints
Bằng: G. Kent Fellows
Được phát hành: (2018-03-01)

Energy and Environmental Policy Trends: The Invisible Cost of Pipeline Constraints
Bằng: G. Kent Fellows
Được phát hành: (2018-03-01)

Energy and Environmental Policy Trends: The Invisible Cost of Pipeline Constraints
Bằng: G. Kent Fellows
Được phát hành: (2018-03-01)

Comparison between fourier and corrected fourier series methods
Bằng: Zainal, Nor Hafizah, et al.
Được phát hành: (2013)

Fourier ptychography algorithm based on scaled Fourier transform
Bằng: Mojde Hasanzade, et al.
Được phát hành: (2021-02-01)

Policy gradient methods for linear quadratic problems
Bằng: Yang, H
Được phát hành: (2022)

Enhanced deep deterministic policy gradient algorithm
Bằng: Jianping CHEN, et al.
Được phát hành: (2018-11-01)

Enhanced deep deterministic policy gradient algorithm
Bằng: Jianping CHEN, et al.
Được phát hành: (2018-11-01)

Policy gradient rules for populations of spiking neurons
Bằng: Urbanczik Robert, et al.
Được phát hành: (2011-07-01)