Dynamic sub-route-based self-adaptive beam search Q-learning algorithm for traveling salesman problem.

In this paper, a dynamic sub-route-based self-adaptive beam search Q-learning (DSRABSQL) algorithm is proposed that provides a reinforcement learning (RL) framework combined with local search to solve the traveling salesman problem (TSP). DSRABSQL builds upon the Q-learning (QL) algorithm. Consideri...

Full description

Bibliographic Details
Main Authors: Jin Zhang, Qing Liu, XiaoHang Han
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2023-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0283207