ARC-AGI-2 技术报告

出处: ARC-AGI-2 Technical Report

发布: 2026年3月10日

📄 中文摘要

抽象与推理语料库(ARC)旨在评估超越模式匹配的泛化能力,要求模型从极少的示例中推断符号规则。该研究提出了一种基于变换器的系统,通过结合神经推理、结构感知先验和在线任务适应,提升了ARC的性能。首先,将ARC推理重新表述为序列建模问题,采用仅125个标记的紧凑任务编码,实现了使用修改后的LongT5架构的高效长上下文处理。其次,提出了一种基于群对称性、网格遍历和自动机扰动的原则性增强框架,确保对表示变化的不变性。通过这些创新,系统在处理复杂推理任务时表现出更强的适应能力和泛化能力。

📄 English Summary

ARC-AGI-2 Technical Report

The Abstraction and Reasoning Corpus (ARC) is designed to assess generalization beyond pattern matching, requiring models to infer symbolic rules from very few examples. A transformer-based system is proposed that enhances ARC performance by integrating neural inference with structure-aware priors and online task adaptation. The approach reformulates ARC reasoning as a sequence modeling problem using a compact task encoding of only 125 tokens, enabling efficient long-context processing with a modified LongT5 architecture. Additionally, a principled augmentation framework is introduced based on group symmetries, grid traversals, and automata perturbations, enforcing invariance to representation change. These innovations demonstrate improved adaptability and generalization capabilities in tackling complex reasoning tasks.

Powered by Cloudflare Workers + Payload CMS + Claude 3.5

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等