Fourier policy gradients

Fourier policy gradients

We propose a new way of deriving policy gradient updates for reinforcement learning. Our technique, based on Fourier analysis, recasts integrals that arise with expected policy gradients as convolutions and turns them into multiplications. The obtained analytical solutions allow us to capture the lo...

Full description

Bibliographic Details
Main Authors:	Fellows, M, Ciosek, K, Whiteson, S
Format:	Conference item
Published:	Journal of Machine Learning Research 2018

Similar Items

Expected policy gradients
by: Ciosek, K, et al.
Published: (2018)

Expected policy gradients for reinforcement learning
by: Ciosek, K, et al.
Published: (2020)

OFFER: Off-environment reinforcement learning
by: Ciosek, K, et al.
Published: (2017)

Counterfactual multi−agent policy gradients
by: Foerster, J, et al.
Published: (2018)

Fast efficient hyperparameter tuning for policy gradient methods
by: Paul, S, et al.
Published: (2019)

Gradients of connectivity as graph Fourier bases of brain activity
by: Giulia Lioi, et al.
Published: (2021-01-01)

FACMAC: Factored multi−agent centralised policy gradients
by: Peng, B, et al.
Published: (2022)

THE IMAGE REGISTRATION OF FOURIER-MELLIN BASED ON THE COMBINATION OF PROJECTION AND GRADIENT PREPROCESSING
by: D. Gao, et al.
Published: (2017-09-01)

REPRESENTATION OF GRADIENTS OF A SCALAR FIELD ON THE SPHERE USING A 2D FOURIER EXPRESSION
by: M. A. Sharifi, et al.
Published: (2015-12-01)

Bayesian Bellman operators
by: Fellows, M, et al.
Published: (2022)

Alternating optimisation and quadrature for robust control
by: Paul, S, et al.
Published: (2018)

Extreme diffraction management in phase-corrected gradient metasurface by fourier harmonic component engineering
by: Wang, Yuxiang, et al.
Published: (2023)

Application of the Fourier Series Expansion Method for the Inversion of Gravity Gradients using Gravity Anomalies
by: Bei Liu, et al.
Published: (2022-12-01)

Gradients in the mammalian cerebellar cortex enable Fourier-like transformation and improve storing capacity
by: Isabelle Straub, et al.
Published: (2020-02-01)

Iris Segmentation using Gradient Magnitude and Fourier Descriptor for Multimodal Biometric Authentication System
by: Defiana Sulaeman, et al.
Published: (2016-10-01)

Robust reinforcement learning with Bayesian optimisation and quadrature
by: Paul, S, et al.
Published: (2020)

GradientDICE: rethinking generalized offline estimation of stationary values
by: Zhang, S, et al.
Published: (2020)

VIREL: A variational inference framework for reinforcement learning
by: Fellows, M, et al.
Published: (2019)

On Quantum Natural Policy Gradients
by: Andre Sequeira, et al.
Published: (2024-01-01)

Multileave gradient descent for fast online learning to rank
by: Whiteson, S, et al.
Published: (2016)

Fourier series /
by: 354160 Ritt, Robert K.
Published: (1970)

Fourier ellipsometry – an ellipsometric approach to Fourier scatterometry
by: Petrik P., et al.
Published: (2015-01-01)

Fourier transform /
by: 393526 Bochner, Salomon, et al.
Published: (1949)

Trainability issues in quantum policy gradients
by: André Sequeira, et al.
Published: (2024-01-01)

Fourier transform, fourier sine and cosine transforms /
by: Nurul 'Aqilah Mohd Hashim, et al.
Published: (2011)

FOURIER2D and FOURIER3D : programs to demonstrate Fourier synthesis in crystallography
by: Glazer, A
Published: (2016)

Energy and Environmental Policy Trends: Indirect Carbon Tax Costs Reduced by Policy Design
by: G. Kent Fellows, et al.
Published: (2023-06-01)

Laplace and fourier transforms
by: 391553 Goyal, J. K., et al.

Applied Fourier transform /
by: Morita, K
Published: (1995)

Fourier BEM : generalization of boundary element methods by Fourier transform /
by: Duddeck, Fabian M.E., 1965-
Published: (2002)

Thermal characteristics of longitudinal fin with Fourier and non-Fourier heat transfer by Fourier sine transforms
by: Basma Souayeh, et al.
Published: (2021-12-01)

Energy and Environmental Policy Trends: The Invisible Cost of Pipeline Constraints
by: G. Kent Fellows
Published: (2018-03-01)

Energy and Environmental Policy Trends: The Invisible Cost of Pipeline Constraints
by: G. Kent Fellows
Published: (2018-03-01)

Energy and Environmental Policy Trends: The Invisible Cost of Pipeline Constraints
by: G. Kent Fellows
Published: (2018-03-01)

Comparison between fourier and corrected fourier series methods
by: Zainal, Nor Hafizah, et al.
Published: (2013)

Fourier ptychography algorithm based on scaled Fourier transform
by: Mojde Hasanzade, et al.
Published: (2021-02-01)

Policy gradient methods for linear quadratic problems
by: Yang, H
Published: (2022)

Enhanced deep deterministic policy gradient algorithm
by: Jianping CHEN, et al.
Published: (2018-11-01)

Enhanced deep deterministic policy gradient algorithm
by: Jianping CHEN, et al.
Published: (2018-11-01)

Policy gradient rules for populations of spiking neurons
by: Urbanczik Robert, et al.
Published: (2011-07-01)