中间迷失问题及为何检索优于填充

出处: The lost-in-the-middle problem and why retrieval beats stuffing

发布: 2026年3月1日

📄 中文摘要

研究表明，语言模型在处理长上下文时存在中间信息被忽视的问题。2023年，斯坦福大学、加州大学伯克利分校和Samaya AI的研究人员发布了《Lost in the Middle: How Language Models Use Long Contexts》一文，测试了模型在不同输入位置上关键信息的表现。结果显示，当关键信息位于上下文的开头或结尾时，模型的表现最佳，而位于中间的信息则常常被忽略。这一发现并非偶然，研究团队在多种模型和上下文长度下进行了测试，结果一致，强调了在使用长上下文时信息位置的重要性。

🏷️ 相关标签

#中间迷失问题 #语言模型 #长上下文 #信息检索 #模型性能

📄 English Summary

The lost-in-the-middle problem and why retrieval beats stuffing

Research indicates that language models face the issue of ignoring information placed in the middle of long contexts. In 2023, researchers from Stanford, UC Berkeley, and Samaya AI published a paper titled 'Lost in the Middle: How Language Models Use Long Contexts,' which tested model performance based on the position of key information in the input. The findings revealed that models performed best when crucial information was located at the beginning or the end of the context, while information in the middle was often disregarded. This was not an isolated finding; the research team conducted tests across various model families and context lengths, consistently highlighting the significance of information positioning when utilizing long contexts.

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等

📄 中文摘要

🏷️ 相关标签

📄 English Summary

The lost-in-the-middle problem and why retrieval beats stuffing

🏷️ Related Tags

📚 相关文章

AI 编程创造了新一类创作者。我就是其中之一。

人工智能成为我学习的助手

Claude CLI "泄露": 没有人赢，AI 仍然幻觉，企业仍在犯同样的错误