利用强化学习构建、合并、求解与适应的最小-最大多旅行商问题

出处: Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

发布: 2026年3月2日

📄 中文摘要

多旅行商问题（mTSP）是旅行商问题的扩展，涉及多个从共同仓库出发并返回的旅行路线，要求所有客户被访问一次且仅一次。在最小-最大变体中，目标是最小化最长的旅行路线，以反映工作负载的平衡。提出了一种混合方法，称为利用强化学习的构建、合并、求解与适应（RL-CMSA），用于对称单仓库的最小-最大mTSP。该方法通过学习的成对q值引导的概率聚类，迭代构建多样化的解决方案，将路线合并为紧凑的池，解决限制集覆盖的MILP问题，并通过跨路线的移除、移动和交换操作来优化解决方案。q值通过强化城市对共现的方式进行更新。

🏷️ 相关标签

#多旅行商问题 #强化学习 #最小-最大 #聚类 #MILP

📄 English Summary

Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

The Multiple Traveling Salesman Problem (mTSP) extends the Traveling Salesman Problem to multiple tours that start and end at a common depot while visiting all customers exactly once. In the min-max variant, the objective is to minimize the longest tour, reflecting workload balance. A hybrid approach, termed Reinforcement Learning-based Construct, Merge, Solve & Adapt (RL-CMSA), is proposed for the symmetric single-depot min-max mTSP. This method iteratively constructs diverse solutions using probabilistic clustering guided by learned pairwise q-values, merges routes into a compact pool, solves a restricted set-covering MILP, and refines solutions through inter-route remove, shift, and swap moves. The q-values are updated by reinforcing city-pair co-occurrence.

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等

📄 中文摘要

🏷️ 相关标签

📄 English Summary

Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

🏷️ Related Tags

📚 相关文章

AI 编程创造了新一类创作者。我就是其中之一。

人工智能成为我学习的助手

Claude CLI "泄露": 没有人赢，AI 仍然幻觉，企业仍在犯同样的错误