Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games

© 2018 AACC. Zero-sum differential games constitute a prominent research topic in several fields ranging from economics to motion planning. Unfortunately, analytical techniques for differential games can address only simple, illustrative problem instances, and most existing computational methods suf...

Full description

Bibliographic Details
Main Authors:	Tal, Ezra, Gorodetsky, Alex, Karaman, Sertac
Other Authors:	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
Format:	Article
Language:	English
Published:	IEEE 2021
Online Access:	https://hdl.handle.net/1721.1/137851

_version_	1826212089549553664
author	Tal, Ezra Gorodetsky, Alex Karaman, Sertac
author2	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
author_facet	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics Tal, Ezra Gorodetsky, Alex Karaman, Sertac
author_sort	Tal, Ezra
collection	MIT
description	© 2018 AACC. Zero-sum differential games constitute a prominent research topic in several fields ranging from economics to motion planning. Unfortunately, analytical techniques for differential games can address only simple, illustrative problem instances, and most existing computational methods suffer from the curse of dimensionality, i.e., the computational requirements grow exponentially with the dimensionality of the state space. In order to alleviate the curse of dimensionality for a certain class of two-player pursuit-evasion games, we propose a novel dynamic-programming-based algorithm that uses a continuous tensor-train approximation to represent the value function. In this way, the algorithm can represent high-dimensional tensors using computational resources that grow only polynomially with dimensionality of the state space and with the rank of the value function. The proposed algorithm is shown to converge to optimal solutions. It is demonstrated in several problem instances; in case of a seven-dimensional game, the value function representation was obtained with seven orders of magnitude savings in computational and memory cost, when compared to standard value iteration.
first_indexed	2024-09-23T15:16:13Z
format	Article
id	mit-1721.1/137851
institution	Massachusetts Institute of Technology
language	English
last_indexed	2024-09-23T15:16:13Z
publishDate	2021
publisher	IEEE
record_format	dspace
spelling	mit-1721.1/1378512023-02-09T21:38:29Z Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games Tal, Ezra Gorodetsky, Alex Karaman, Sertac Massachusetts Institute of Technology. Department of Aeronautics and Astronautics Massachusetts Institute of Technology. Laboratory for Information and Decision Systems © 2018 AACC. Zero-sum differential games constitute a prominent research topic in several fields ranging from economics to motion planning. Unfortunately, analytical techniques for differential games can address only simple, illustrative problem instances, and most existing computational methods suffer from the curse of dimensionality, i.e., the computational requirements grow exponentially with the dimensionality of the state space. In order to alleviate the curse of dimensionality for a certain class of two-player pursuit-evasion games, we propose a novel dynamic-programming-based algorithm that uses a continuous tensor-train approximation to represent the value function. In this way, the algorithm can represent high-dimensional tensors using computational resources that grow only polynomially with dimensionality of the state space and with the rank of the value function. The proposed algorithm is shown to converge to optimal solutions. It is demonstrated in several problem instances; in case of a seven-dimensional game, the value function representation was obtained with seven orders of magnitude savings in computational and memory cost, when compared to standard value iteration. 2021-11-09T13:15:12Z 2021-11-09T13:15:12Z 2018-06 2019-10-28T18:31:10Z Article http://purl.org/eprint/type/ConferencePaper https://hdl.handle.net/1721.1/137851 Tal, Ezra, Gorodetsky, Alex and Karaman, Sertac. 2018. "Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games." en 10.23919/acc.2018.8431472 Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf IEEE website
spellingShingle	Tal, Ezra Gorodetsky, Alex Karaman, Sertac Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games
title	Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games
title_full	Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games
title_fullStr	Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games
title_full_unstemmed	Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games
title_short	Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games
title_sort	continuous tensor train based dynamic programming for high dimensional zero sum differential games
url	https://hdl.handle.net/1721.1/137851
work_keys_str_mv	AT talezra continuoustensortrainbaseddynamicprogrammingforhighdimensionalzerosumdifferentialgames AT gorodetskyalex continuoustensortrainbaseddynamicprogrammingforhighdimensionalzerosumdifferentialgames AT karamansertac continuoustensortrainbaseddynamicprogrammingforhighdimensionalzerosumdifferentialgames

Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games

Similar Items