RealChart2Code：利用真实数据和多任务评估推进图表到代码生成

出处: RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

发布: 2026年3月30日

📄 中文摘要

RealChart2Code是一个新的大规模基准，包含超过2800个实例，基于真实数据集并具有明确的分析意图任务。该基准首次系统性地评估了从大规模原始数据生成图表的能力，并在多轮对话环境中评估代码的迭代优化。对14个领先的视觉-语言模型（VLM）在RealChart2Code上的综合评估显示，与传统方法相比，性能显著下降。这一研究为图表生成领域提供了新的评估标准，推动了图表生成技术的发展。

🏷️ 相关标签

#图表生成 #视觉-语言模型 #多任务评估 #真实数据 #基准测试

📄 English Summary

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

RealChart2Code is a new large-scale benchmark comprising over 2,800 instances grounded in authentic datasets with tasks that have clear analytical intent. It is the first benchmark to systematically evaluate chart generation from large-scale raw data and to assess iterative code refinement in a multi-turn conversational setting. A comprehensive evaluation of 14 leading Vision-Language Models (VLMs) on RealChart2Code reveals significant performance degradation compared to traditional methods. This research establishes a new evaluation standard for the field of chart generation, advancing the development of chart generation technologies.

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等

📄 中文摘要

🏷️ 相关标签

📄 English Summary

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

🏷️ Related Tags

📚 相关文章

AI 编程创造了新一类创作者。我就是其中之一。

人工智能成为我学习的助手

Claude CLI "泄露": 没有人赢，AI 仍然幻觉，企业仍在犯同样的错误