FactorSmith:通过马尔可夫决策过程分解生成自主模拟

📄 中文摘要

生成可执行的模拟程序从自然语言规范中提取仍然是一个具有挑战性的问题,尤其是在面对大型、互联的代码库时,大型语言模型(LLMs)的推理能力受到限制。FactorSmith框架通过结合两个互补的概念,提出了一种从文本描述合成可玩游戏模拟的解决方案:一是基于分解的部分可观测马尔可夫决策过程(POMDP)进行原则性上下文缩减,二是采用分层的规划者-设计者-评论者的自主工作流程,在每个生成步骤中进行迭代质量改进。该方法借鉴了FactorSim提出的分解POMDP表示,旨在有效地处理模拟规范的复杂性。通过这种方法,可以提高生成模拟的质量和可玩性。

📄 English Summary

FactorSmith: Agentic Simulation Generation via Markov Decision Process Decomposition with Planner-Designer-Critic Refinement

Generating executable simulations from natural language specifications poses significant challenges due to the limited reasoning capabilities of large language models (LLMs) when dealing with extensive interconnected codebases. FactorSmith presents a framework that synthesizes playable game simulations in code from textual descriptions by integrating two complementary concepts: factored POMDP decomposition for principled context reduction, and a hierarchical planner-designer-critic agentic workflow for iterative quality refinement at each generation step. Drawing from the factored partially observable Markov decision process (POMDP) representation introduced by FactorSim, the proposed method effectively decomposes simulation specifications to manage their complexity. This approach aims to enhance the quality and playability of the generated simulations.

Powered by Cloudflare Workers + Payload CMS + Claude 3.5

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等