我的大型语言模型开始对我的应用撒谎，我没有注意到三天

出处: My LLM Started Lying to My App and I Didn't Notice for Three Days

发布: 2026年3月12日

📄 中文摘要

一名用户通过Slack发消息指出摘要看起来奇怪，经过调查发现，模型在72小时内约12%的请求返回了格式错误的JSON。错误处理机制吞噬了解析失败，导致用户获取到的是前一天的数据而非当天的数据。没有监控系统能够捕捉到这个问题，因为没有抛出异常，模型在几个月内可靠遵循的格式指令突然失效。尽管模型版本和提示没有变化，但其行为却发生了改变。

🏷️ 相关标签

#大型语言模型 #格式错误 #错误处理 #数据监控

📄 English Summary

My LLM Started Lying to My App and I Didn't Notice for Three Days

A user alerted via Slack that the summaries appeared strange. Upon investigation, it was found that the model had been returning malformed JSON on about 12% of requests for 72 hours. The error handling mechanism was swallowing parse failures, resulting in users receiving stale data labeled as current. No monitoring system caught the issue since no exceptions were raised; the model quietly deviated from format instructions it had reliably followed for months. Despite no changes to the model version or prompt, the behavior had altered.

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等

📄 中文摘要

🏷️ 相关标签

📄 English Summary

My LLM Started Lying to My App and I Didn't Notice for Three Days

🏷️ Related Tags

📚 相关文章

AI 编程创造了新一类创作者。我就是其中之一。

人工智能成为我学习的助手

Claude CLI "泄露": 没有人赢，AI 仍然幻觉，企业仍在犯同样的错误