滚动起源验证逆转多步PM10预测中的模型排名：XGBoost、SARIMA和持久性

出处: Rolling-Origin Validation Reverses Model Rankings in Multi-Step PM10 Forecasting: XGBoost, SARIMA, and Persistence

发布: 2026年3月24日

📄 中文摘要

许多空气质量预测研究报告了机器学习的优势，但评估通常使用静态时间分割，并且省略了持久性基准，因此在常规更新下的操作附加值尚不明确。利用2017年至2024年间来自南欧城市背景监测站的2350个每日PM10观测数据，比较了在静态分割和每月更新的滚动起源协议下，XGBoost和SARIMA与持久性模型的表现。报告了特定时间范围的技能和可预测时间范围，后者定义为具有正持久性相对技能的最大时间范围。静态评估表明，XGBoost在一到七天的预测中表现良好，但滚动起源评估则逆转了排名：XGBoost的表现不如预期。

🏷️ 相关标签

#PM10预测 #机器学习 #XGBoost #SARIMA #空气质量

📄 English Summary

Rolling-Origin Validation Reverses Model Rankings in Multi-Step PM10 Forecasting: XGBoost, SARIMA, and Persistence

Many air quality forecasting studies report gains from machine learning, but evaluations often rely on static chronological splits and omit persistence baselines, leaving the operational added value under routine updates unclear. Using 2,350 daily PM10 observations from 2017 to 2024 at an urban background monitoring station in southern Europe, a comparison is made between XGBoost and SARIMA against persistence under both static splits and a rolling-origin protocol with monthly updates. Horizon-specific skill and the predictability horizon, defined as the maximum horizon with positive persistence-relative skill, are reported. Static evaluation suggests that XGBoost performs well from one to seven days ahead; however, rolling-origin evaluation reverses these rankings, indicating that XGBoost does not perform as expected.

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等

📄 中文摘要

🏷️ 相关标签

📄 English Summary

Rolling-Origin Validation Reverses Model Rankings in Multi-Step PM10 Forecasting: XGBoost, SARIMA, and Persistence

🏷️ Related Tags

📚 相关文章

AI 编程创造了新一类创作者。我就是其中之一。

人工智能成为我学习的助手

Claude CLI "泄露": 没有人赢，AI 仍然幻觉，企业仍在犯同样的错误