利用强化学习构建、合并、求解与适应的最小-最大多旅行商问题
📄 中文摘要
多旅行商问题(mTSP)是旅行商问题的扩展,涉及多个从共同仓库出发并返回的旅行路线,要求所有客户被访问一次且仅一次。在最小-最大变体中,目标是最小化最长的旅行路线,以反映工作负载的平衡。提出了一种混合方法,称为利用强化学习的构建、合并、求解与适应(RL-CMSA),用于对称单仓库的最小-最大mTSP。该方法通过学习的成对q值引导的概率聚类,迭代构建多样化的解决方案,将路线合并为紧凑的池,解决限制集覆盖的MILP问题,并通过跨路线的移除、移动和交换操作来优化解决方案。q值通过强化城市对共现的方式进行更新。
📄 English Summary
Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem
The Multiple Traveling Salesman Problem (mTSP) extends the Traveling Salesman Problem to multiple tours that start and end at a common depot while visiting all customers exactly once. In the min-max variant, the objective is to minimize the longest tour, reflecting workload balance. A hybrid approach, termed Reinforcement Learning-based Construct, Merge, Solve & Adapt (RL-CMSA), is proposed for the symmetric single-depot min-max mTSP. This method iteratively constructs diverse solutions using probabilistic clustering guided by learned pairwise q-values, merges routes into a compact pool, solves a restricted set-covering MILP, and refines solutions through inter-route remove, shift, and swap moves. The q-values are updated by reinforcing city-pair co-occurrence.
Powered by Cloudflare Workers + Payload CMS + Claude 3.5
数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等