Proximal algorithms and temporal difference methods for solving fixed point problems

Abstract In this paper we consider large fixed point problems and solution with proximal algorithms. We show that for linear problems there is a close connection between proximal iterations, which are prominent in numerical analysis and optimization, and multistep methods of the tempo...

Full description

Bibliographic Details
Main Author:	Bertsekas, Dimitri P
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format:	Article
Language:	English
Published:	Springer US 2021
Online Access:	https://hdl.handle.net/1721.1/131865

_version_	1811069675640979456
author	Bertsekas, Dimitri P
author2	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
author_facet	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Bertsekas, Dimitri P
author_sort	Bertsekas, Dimitri P
collection	MIT
description	Abstract In this paper we consider large fixed point problems and solution with proximal algorithms. We show that for linear problems there is a close connection between proximal iterations, which are prominent in numerical analysis and optimization, and multistep methods of the temporal difference type such as TD( $$\lambda $$ λ ), LSTD( $$\lambda $$ λ ), and LSPE( $$\lambda $$ λ ), which are central in simulation-based exact and approximate dynamic programming. One benefit of this connection is a new and simple way to accelerate the standard proximal algorithm by extrapolation towards a multistep iteration, which generically has a faster convergence rate. Another benefit is the potential for integration into the proximal algorithmic context of several new ideas that have emerged in the approximate dynamic programming context, including simulation-based implementations. Conversely, the analytical and algorithmic insights from proximal algorithms can be brought to bear on the analysis and the enhancement of temporal difference methods. We also generalize our linear case result to nonlinear problems that involve a contractive mapping, thus providing guaranteed and potentially substantial acceleration of the proximal and forward backward splitting algorithms at no extra cost. Moreover, under certain monotonicity assumptions, we extend the connection with temporal difference methods to nonlinear problems through a linearization approach.
first_indexed	2024-09-23T08:14:10Z
format	Article
id	mit-1721.1/131865
institution	Massachusetts Institute of Technology
language	English
last_indexed	2024-09-23T08:14:10Z
publishDate	2021
publisher	Springer US
record_format	dspace
spelling	mit-1721.1/1318652023-12-22T19:11:46Z Proximal algorithms and temporal difference methods for solving fixed point problems Bertsekas, Dimitri P Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology. Laboratory for Information and Decision Systems Abstract In this paper we consider large fixed point problems and solution with proximal algorithms. We show that for linear problems there is a close connection between proximal iterations, which are prominent in numerical analysis and optimization, and multistep methods of the temporal difference type such as TD( $$\lambda $$ λ ), LSTD( $$\lambda $$ λ ), and LSPE( $$\lambda $$ λ ), which are central in simulation-based exact and approximate dynamic programming. One benefit of this connection is a new and simple way to accelerate the standard proximal algorithm by extrapolation towards a multistep iteration, which generically has a faster convergence rate. Another benefit is the potential for integration into the proximal algorithmic context of several new ideas that have emerged in the approximate dynamic programming context, including simulation-based implementations. Conversely, the analytical and algorithmic insights from proximal algorithms can be brought to bear on the analysis and the enhancement of temporal difference methods. We also generalize our linear case result to nonlinear problems that involve a contractive mapping, thus providing guaranteed and potentially substantial acceleration of the proximal and forward backward splitting algorithms at no extra cost. Moreover, under certain monotonicity assumptions, we extend the connection with temporal difference methods to nonlinear problems through a linearization approach. 2021-09-20T17:30:42Z 2021-09-20T17:30:42Z 2018-03-02 2020-09-24T21:34:30Z Article http://purl.org/eprint/type/JournalArticle https://hdl.handle.net/1721.1/131865 en https://doi.org/10.1007/s10589-018-9990-5 Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ Springer Science+Business Media, LLC, part of Springer Nature application/pdf Springer US Springer US
spellingShingle	Bertsekas, Dimitri P Proximal algorithms and temporal difference methods for solving fixed point problems
title	Proximal algorithms and temporal difference methods for solving fixed point problems
title_full	Proximal algorithms and temporal difference methods for solving fixed point problems
title_fullStr	Proximal algorithms and temporal difference methods for solving fixed point problems
title_full_unstemmed	Proximal algorithms and temporal difference methods for solving fixed point problems
title_short	Proximal algorithms and temporal difference methods for solving fixed point problems
title_sort	proximal algorithms and temporal difference methods for solving fixed point problems
url	https://hdl.handle.net/1721.1/131865
work_keys_str_mv	AT bertsekasdimitrip proximalalgorithmsandtemporaldifferencemethodsforsolvingfixedpointproblems

Proximal algorithms and temporal difference methods for solving fixed point problems

Similar Items