当前SOTA模型在没有安全评估的情况下发布

📄 中文摘要

OpenAI于2026年3月5日发布了GPT-5.4 Thinking和GPT-5.4 Pro。GPT-5.4 Pro被认为是世界上在许多与灾难风险相关的任务中表现最佳的模型,包括生物研究研发、协调网络攻击操作和计算机使用。该模型没有系统卡,并且据我们所知,是在没有任何安全评估的情况下发布的。我们指出,这种情况至少在之前的GPT-5.2 Pro中也曾发生,并提供了关于如何在模型部署后进行快速独立风险评估的建议。

📄 English Summary

The current SOTA model was released without safety evals

OpenAI released GPT-5.4 Thinking and GPT-5.4 Pro on March 5, 2026. GPT-5.4 Pro is considered the best model in the world for many catastrophic risk-relevant tasks, including biological research R&D, orchestrating cyberoffense operations, and computer use. It has no system card and, to our best knowledge, was released without any safety evaluations. This situation has occurred at least once before with GPT-5.2 Pro. Recommendations are provided on how teams could conduct fast, independent risk assessments of models post-deployment.

Powered by Cloudflare Workers + Payload CMS + Claude 3.5

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等