更大利益攸关:报酬与语言如何影响大型语言模型在合作困境中的策略

📄 中文摘要

大型语言模型(LLM)作为交互式多智能体环境中的自主代理,其战略行为对于人工智能系统的安全性、协调性以及社会和经济影响至关重要。研究通过使用报酬缩放的囚徒困境,系统性地探究了报酬大小和语言情境如何塑造LLM在重复社会困境中的策略,旨在独立分析激励强度对LLM行为敏感性的影响。实验跨越多种LLM模型和不同自然语言,结果显示出一致的行为模式。观察到LLM的合作倾向并非固定不变,而是显著受到报酬结构的影响,高额报酬通常能驱动更强的合作意愿。同时,语言的细微差别,例如指令的措辞和情境描述,也对LLM的决策过程产生可量化的影响,表明LLM并非简单地执行预设规则,而是能够对语言提示进行深度理解和情境化推理。这些发现强调了在设计和部署基于LLM的智能体时,需要细致考虑其所处经济激励框架和沟通方式,以确保其行为符合预期并促进多智能体系统中的合作与效率。理解这些因素有助于开发更可靠、更易于管理的LLM驱动系统,特别是在涉及高风险决策或复杂社会交互的场景中。

📄 English Summary

More at Stake: How Payoff and Language Shape LLM Agent Strategies in Cooperation Dilemmas

As Large Language Models (LLMs) increasingly function as autonomous agents in interactive and multi-agent settings, comprehending their strategic behavior is paramount for ensuring the safety, coordination, and efficacy of AI-driven social and economic systems. This investigation systematically examines how the magnitude of payoffs and the surrounding linguistic context influence LLM strategies within repeated social dilemmas. Utilizing a payoff-scaled Prisoner's Dilemma, the study meticulously isolates and quantifies the LLMs' sensitivity to varying incentive strengths. Consistent behavioral patterns are observed across a diverse range of LLM models and different natural languages. The findings indicate that LLMs' propensity for cooperation is not static but significantly modulated by the payoff structure, with higher stakes generally correlating with a stronger inclination towards cooperative actions. Furthermore, subtle linguistic nuances, including the precise phrasing of instructions and contextual descriptions, exert measurable effects on LLM decision-making processes. This suggests that LLMs do not merely execute predefined rules but engage in sophisticated comprehension and contextual reasoning based on linguistic prompts. These insights underscore the critical need to meticulously consider both the economic incentive frameworks and communication modalities when designing and deploying LLM-based agents. Such considerations are essential for ensuring predictable behavior and fostering cooperation and efficiency within multi-agent systems. A deeper understanding of these factors will facilitate the development of more robust and manageable LLM-driven systems, particularly in scenarios involving high-stakes decisions or intricate social interactions.

Powered by Cloudflare Workers + Payload CMS + Claude 3.5

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等