巴西登革热住院数据集(1999-2021):月度数据周粒度分解
📄 中文摘要
公布了巴西登革热住院数据集(v1.0.0),该数据集已发布在Zenodo平台,DOI为10.5281/zenodo.18189192。为了提高原始月度数据的时序粒度,进而更有效地训练用于流行病学预测的AI模型,该数据集对巴西全国范围内的市级登革热住院时间序列进行了统一处理,并将其从月度数据分解为周粒度(按流行病学周划分)。数据集涵盖了1999年至2021年期间的登革热住院病例数据,为研究人员和AI开发者提供了更精细的时间分辨率数据,以支持登革热传播模式的分析和预测模型的开发。通过将月度汇总数据转换为周度数据,可以捕获更细微的流行病学动态,这对于短期预测和早期预警系统至关重要。数据集的构建涉及对不同来源和格式的原始数据进行标准化和清洗,确保了数据的一致性和可靠性。数据集中包含的地理信息允许进行区域性分析,进一步揭示登革热在不同城市和地区间的传播差异。该数据集的发布旨在促进开放科学,鼓励更广泛的合作,共同应对登革热带来的公共卫生挑战,并为AI在公共卫生领域的应用提供高质量的数据基础。数据的可访问性将有助于加速登革热预测算法的创新和部署,从而提高疾病监测和控制的效率。
📄 English Summary
A Dataset of Dengue Hospitalizations in Brazil (1999 to 2021) with Weekly Disaggregation from Monthly Counts
A dataset (v1.0.0) of dengue hospitalizations in Brazil is publicly released, available on Zenodo under DOI 10.5281/zenodo.18189192. Driven by the necessity to enhance the temporal granularity of originally monthly data for more effective training of AI models in epidemiological forecasting, this dataset harmonizes municipal-level dengue hospitalization time series across Brazil and disaggregates them to weekly resolution (epidemiological weeks). The dataset spans the period from 1999 to 2021, offering researchers and AI developers finer temporal resolution data to support the analysis of dengue transmission patterns and the development of predictive models. Transforming monthly aggregated data into weekly data allows for the capture of more nuanced epidemiological dynamics, which is crucial for short-term forecasting and early warning systems. The construction of this dataset involved standardizing and cleaning raw data from various sources and formats, ensuring data consistency and reliability. The included geographical information enables regional analysis, further revealing differences in dengue transmission across various cities and regions. The release of this dataset aims to foster open science, encourage broader collaboration to address public health challenges posed by dengue, and provide a high-quality data foundation for AI applications in public health. Data accessibility will help accelerate the innovation and deployment of dengue prediction algorithms, thereby improving the efficiency of disease surveillance and control.
Powered by Cloudflare Workers + Payload CMS + Claude 3.5
数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等