Generative inverse reinforcement learning for learning 2-opt heuristics without extrinsic rewards in routing problems
Deep reinforcement learning (DRL) has shown promise in solving challenging combinatorial optimization (CO) problems, such as the traveling salesman problem (TSP) and vehicle routing problem (VRP). However, existing DRL methods rely on manually designed reward functions, which may be inaccurate or un...
Váldodahkkit: | Qi Wang, Yongsheng Hao, Jiawei Zhang |
---|---|
Materiálatiipa: | Artihkal |
Giella: | English |
Almmustuhtton: |
Elsevier
2023-10-01
|
Ráidu: | Journal of King Saud University: Computer and Information Sciences |
Fáttát: | |
Liŋkkat: | http://www.sciencedirect.com/science/article/pii/S1319157823003415 |
Geahča maid
-
Discovering Lin-Kernighan-Helsgaun heuristic for routing optimization using self-supervised reinforcement learning
Dahkki: Qi Wang, et al.
Almmustuhtton: (2023-09-01) -
A review of reinforcement learning based hyper-heuristics
Dahkki: Cuixia Li, et al.
Almmustuhtton: (2024-06-01) -
A neighborhood search-based heuristic for the dynamic vehicle routing problem
Dahkki: Wilton Gustavo Gomes da Costa, et al.
Almmustuhtton: (2025-01-01) -
Generative Adversarial Inverse Reinforcement Learning With Deep Deterministic Policy Gradient
Dahkki: Ming Zhan, et al.
Almmustuhtton: (2023-01-01) -
Improving
acceptability of nudges: Learning from attitudes towards opt-in and opt-out
policies
Dahkki: Haoyang Yan, et al.
Almmustuhtton: (2019-01-01)