Generative inverse reinforcement learning for learning 2-opt heuristics without extrinsic rewards in routing problems

Deep reinforcement learning (DRL) has shown promise in solving challenging combinatorial optimization (CO) problems, such as the traveling salesman problem (TSP) and vehicle routing problem (VRP). However, existing DRL methods rely on manually designed reward functions, which may be inaccurate or un...

Olles dieđut

Bibliográfalaš dieđut
Váldodahkkit: Qi Wang, Yongsheng Hao, Jiawei Zhang
Materiálatiipa: Artihkal
Giella:English
Almmustuhtton: Elsevier 2023-10-01
Ráidu:Journal of King Saud University: Computer and Information Sciences
Fáttát:
Liŋkkat:http://www.sciencedirect.com/science/article/pii/S1319157823003415

Geahča maid