滚动起源验证逆转多步PM10预测中的模型排名:XGBoost、SARIMA和持久性

📄 中文摘要

许多空气质量预测研究报告了机器学习的优势,但评估通常使用静态时间分割,并且省略了持久性基准,因此在常规更新下的操作附加值尚不明确。利用2017年至2024年间来自南欧城市背景监测站的2350个每日PM10观测数据,比较了在静态分割和每月更新的滚动起源协议下,XGBoost和SARIMA与持久性模型的表现。报告了特定时间范围的技能和可预测时间范围,后者定义为具有正持久性相对技能的最大时间范围。静态评估表明,XGBoost在一到七天的预测中表现良好,但滚动起源评估则逆转了排名:XGBoost的表现不如预期。

📄 English Summary

Rolling-Origin Validation Reverses Model Rankings in Multi-Step PM10 Forecasting: XGBoost, SARIMA, and Persistence

Many air quality forecasting studies report gains from machine learning, but evaluations often rely on static chronological splits and omit persistence baselines, leaving the operational added value under routine updates unclear. Using 2,350 daily PM10 observations from 2017 to 2024 at an urban background monitoring station in southern Europe, a comparison is made between XGBoost and SARIMA against persistence under both static splits and a rolling-origin protocol with monthly updates. Horizon-specific skill and the predictability horizon, defined as the maximum horizon with positive persistence-relative skill, are reported. Static evaluation suggests that XGBoost performs well from one to seven days ahead; however, rolling-origin evaluation reverses these rankings, indicating that XGBoost does not perform as expected.

Powered by Cloudflare Workers + Payload CMS + Claude 3.5

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等