FactorSmith：通过马尔可夫决策过程分解生成自主模拟

出处: FactorSmith: Agentic Simulation Generation via Markov Decision Process Decomposition with Planner-Designer-Critic Refinement

发布: 2026年3月24日

📄 中文摘要

生成可执行的模拟程序从自然语言规范中提取仍然是一个具有挑战性的问题，尤其是在面对大型、互联的代码库时，大型语言模型（LLMs）的推理能力受到限制。FactorSmith框架通过结合两个互补的概念，提出了一种从文本描述合成可玩游戏模拟的解决方案：一是基于分解的部分可观测马尔可夫决策过程（POMDP）进行原则性上下文缩减，二是采用分层的规划者-设计者-评论者的自主工作流程，在每个生成步骤中进行迭代质量改进。该方法借鉴了FactorSim提出的分解POMDP表示，旨在有效地处理模拟规范的复杂性。通过这种方法，可以提高生成模拟的质量和可玩性。

🏷️ 相关标签

#自主模拟 #马尔可夫决策过程 #分层规划 #游戏模拟 #自然语言处理

📄 English Summary

FactorSmith: Agentic Simulation Generation via Markov Decision Process Decomposition with Planner-Designer-Critic Refinement

Generating executable simulations from natural language specifications poses significant challenges due to the limited reasoning capabilities of large language models (LLMs) when dealing with extensive interconnected codebases. FactorSmith presents a framework that synthesizes playable game simulations in code from textual descriptions by integrating two complementary concepts: factored POMDP decomposition for principled context reduction, and a hierarchical planner-designer-critic agentic workflow for iterative quality refinement at each generation step. Drawing from the factored partially observable Markov decision process (POMDP) representation introduced by FactorSim, the proposed method effectively decomposes simulation specifications to manage their complexity. This approach aims to enhance the quality and playability of the generated simulations.

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等

📄 中文摘要

🏷️ 相关标签

📄 English Summary

FactorSmith: Agentic Simulation Generation via Markov Decision Process Decomposition with Planner-Designer-Critic Refinement

🏷️ Related Tags

📚 相关文章

AI 编程创造了新一类创作者。我就是其中之一。

人工智能成为我学习的助手

Claude CLI "泄露": 没有人赢，AI 仍然幻觉，企业仍在犯同样的错误