Fourier policy gradients

Fourier policy gradients

We propose a new way of deriving policy gradient updates for reinforcement learning. Our technique, based on Fourier analysis, recasts integrals that arise with expected policy gradients as convolutions and turns them into multiplications. The obtained analytical solutions allow us to capture the lo...

Bibliografske podrobnosti
Main Authors:	Fellows, M, Ciosek, K, Whiteson, S
Format:	Conference item
Izdano:	Journal of Machine Learning Research 2018

Podobne knjige/članki

Expected policy gradients
od: Ciosek, K, et al.
Izdano: (2018)

Expected policy gradients for reinforcement learning
od: Ciosek, K, et al.
Izdano: (2020)

OFFER: Off-environment reinforcement learning
od: Ciosek, K, et al.
Izdano: (2017)

Counterfactual multi−agent policy gradients
od: Foerster, J, et al.
Izdano: (2018)

Fast efficient hyperparameter tuning for policy gradient methods
od: Paul, S, et al.
Izdano: (2019)

Gradients of connectivity as graph Fourier bases of brain activity
od: Giulia Lioi, et al.
Izdano: (2021-01-01)

FACMAC: Factored multi−agent centralised policy gradients
od: Peng, B, et al.
Izdano: (2022)

THE IMAGE REGISTRATION OF FOURIER-MELLIN BASED ON THE COMBINATION OF PROJECTION AND GRADIENT PREPROCESSING
od: D. Gao, et al.
Izdano: (2017-09-01)

REPRESENTATION OF GRADIENTS OF A SCALAR FIELD ON THE SPHERE USING A 2D FOURIER EXPRESSION
od: M. A. Sharifi, et al.
Izdano: (2015-12-01)

Bayesian Bellman operators
od: Fellows, M, et al.
Izdano: (2022)

Alternating optimisation and quadrature for robust control
od: Paul, S, et al.
Izdano: (2018)

Extreme diffraction management in phase-corrected gradient metasurface by fourier harmonic component engineering
od: Wang, Yuxiang, et al.
Izdano: (2023)

Application of the Fourier Series Expansion Method for the Inversion of Gravity Gradients using Gravity Anomalies
od: Bei Liu, et al.
Izdano: (2022-12-01)

Gradients in the mammalian cerebellar cortex enable Fourier-like transformation and improve storing capacity
od: Isabelle Straub, et al.
Izdano: (2020-02-01)

Iris Segmentation using Gradient Magnitude and Fourier Descriptor for Multimodal Biometric Authentication System
od: Defiana Sulaeman, et al.
Izdano: (2016-10-01)

Robust reinforcement learning with Bayesian optimisation and quadrature
od: Paul, S, et al.
Izdano: (2020)

GradientDICE: rethinking generalized offline estimation of stationary values
od: Zhang, S, et al.
Izdano: (2020)

VIREL: A variational inference framework for reinforcement learning
od: Fellows, M, et al.
Izdano: (2019)

On Quantum Natural Policy Gradients
od: Andre Sequeira, et al.
Izdano: (2024-01-01)

Multileave gradient descent for fast online learning to rank
od: Whiteson, S, et al.
Izdano: (2016)

Fourier series /
od: 354160 Ritt, Robert K.
Izdano: (1970)

Fourier ellipsometry – an ellipsometric approach to Fourier scatterometry
od: Petrik P., et al.
Izdano: (2015-01-01)

Fourier transform /
od: 393526 Bochner, Salomon, et al.
Izdano: (1949)

Trainability issues in quantum policy gradients
od: André Sequeira, et al.
Izdano: (2024-01-01)

Fourier transform, fourier sine and cosine transforms /
od: Nurul 'Aqilah Mohd Hashim, et al.
Izdano: (2011)

FOURIER2D and FOURIER3D : programs to demonstrate Fourier synthesis in crystallography
od: Glazer, A
Izdano: (2016)

Energy and Environmental Policy Trends: Indirect Carbon Tax Costs Reduced by Policy Design
od: G. Kent Fellows, et al.
Izdano: (2023-06-01)

Laplace and fourier transforms
od: 391553 Goyal, J. K., et al.

Applied Fourier transform /
od: Morita, K
Izdano: (1995)

Fourier BEM : generalization of boundary element methods by Fourier transform /
od: Duddeck, Fabian M.E., 1965-
Izdano: (2002)

Thermal characteristics of longitudinal fin with Fourier and non-Fourier heat transfer by Fourier sine transforms
od: Basma Souayeh, et al.
Izdano: (2021-12-01)

Energy and Environmental Policy Trends: The Invisible Cost of Pipeline Constraints
od: G. Kent Fellows
Izdano: (2018-03-01)

Energy and Environmental Policy Trends: The Invisible Cost of Pipeline Constraints
od: G. Kent Fellows
Izdano: (2018-03-01)

Energy and Environmental Policy Trends: The Invisible Cost of Pipeline Constraints
od: G. Kent Fellows
Izdano: (2018-03-01)

Comparison between fourier and corrected fourier series methods
od: Zainal, Nor Hafizah, et al.
Izdano: (2013)

Fourier ptychography algorithm based on scaled Fourier transform
od: Mojde Hasanzade, et al.
Izdano: (2021-02-01)

Policy gradient methods for linear quadratic problems
od: Yang, H
Izdano: (2022)

Enhanced deep deterministic policy gradient algorithm
od: Jianping CHEN, et al.
Izdano: (2018-11-01)

Enhanced deep deterministic policy gradient algorithm
od: Jianping CHEN, et al.
Izdano: (2018-11-01)

Policy gradient rules for populations of spiking neurons
od: Urbanczik Robert, et al.
Izdano: (2011-07-01)