Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games

© 2018 AACC. Zero-sum differential games constitute a prominent research topic in several fields ranging from economics to motion planning. Unfortunately, analytical techniques for differential games can address only simple, illustrative problem instances, and most existing computational methods suf...

Full description

Bibliographic Details
Main Authors: Tal, Ezra, Gorodetsky, Alex, Karaman, Sertac
Other Authors: Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
Format: Article
Language:English
Published: IEEE 2021
Online Access:https://hdl.handle.net/1721.1/137851
_version_ 1826212089549553664
author Tal, Ezra
Gorodetsky, Alex
Karaman, Sertac
author2 Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
author_facet Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
Tal, Ezra
Gorodetsky, Alex
Karaman, Sertac
author_sort Tal, Ezra
collection MIT
description © 2018 AACC. Zero-sum differential games constitute a prominent research topic in several fields ranging from economics to motion planning. Unfortunately, analytical techniques for differential games can address only simple, illustrative problem instances, and most existing computational methods suffer from the curse of dimensionality, i.e., the computational requirements grow exponentially with the dimensionality of the state space. In order to alleviate the curse of dimensionality for a certain class of two-player pursuit-evasion games, we propose a novel dynamic-programming-based algorithm that uses a continuous tensor-train approximation to represent the value function. In this way, the algorithm can represent high-dimensional tensors using computational resources that grow only polynomially with dimensionality of the state space and with the rank of the value function. The proposed algorithm is shown to converge to optimal solutions. It is demonstrated in several problem instances; in case of a seven-dimensional game, the value function representation was obtained with seven orders of magnitude savings in computational and memory cost, when compared to standard value iteration.
first_indexed 2024-09-23T15:16:13Z
format Article
id mit-1721.1/137851
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T15:16:13Z
publishDate 2021
publisher IEEE
record_format dspace
spelling mit-1721.1/1378512023-02-09T21:38:29Z Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games Tal, Ezra Gorodetsky, Alex Karaman, Sertac Massachusetts Institute of Technology. Department of Aeronautics and Astronautics Massachusetts Institute of Technology. Laboratory for Information and Decision Systems © 2018 AACC. Zero-sum differential games constitute a prominent research topic in several fields ranging from economics to motion planning. Unfortunately, analytical techniques for differential games can address only simple, illustrative problem instances, and most existing computational methods suffer from the curse of dimensionality, i.e., the computational requirements grow exponentially with the dimensionality of the state space. In order to alleviate the curse of dimensionality for a certain class of two-player pursuit-evasion games, we propose a novel dynamic-programming-based algorithm that uses a continuous tensor-train approximation to represent the value function. In this way, the algorithm can represent high-dimensional tensors using computational resources that grow only polynomially with dimensionality of the state space and with the rank of the value function. The proposed algorithm is shown to converge to optimal solutions. It is demonstrated in several problem instances; in case of a seven-dimensional game, the value function representation was obtained with seven orders of magnitude savings in computational and memory cost, when compared to standard value iteration. 2021-11-09T13:15:12Z 2021-11-09T13:15:12Z 2018-06 2019-10-28T18:31:10Z Article http://purl.org/eprint/type/ConferencePaper https://hdl.handle.net/1721.1/137851 Tal, Ezra, Gorodetsky, Alex and Karaman, Sertac. 2018. "Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games." en 10.23919/acc.2018.8431472 Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf IEEE website
spellingShingle Tal, Ezra
Gorodetsky, Alex
Karaman, Sertac
Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games
title Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games
title_full Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games
title_fullStr Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games
title_full_unstemmed Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games
title_short Continuous Tensor Train-Based Dynamic Programming for High-Dimensional Zero-Sum Differential Games
title_sort continuous tensor train based dynamic programming for high dimensional zero sum differential games
url https://hdl.handle.net/1721.1/137851
work_keys_str_mv AT talezra continuoustensortrainbaseddynamicprogrammingforhighdimensionalzerosumdifferentialgames
AT gorodetskyalex continuoustensortrainbaseddynamicprogrammingforhighdimensionalzerosumdifferentialgames
AT karamansertac continuoustensortrainbaseddynamicprogrammingforhighdimensionalzerosumdifferentialgames