📄 中文摘要
提出了一种名为AttentionMixer的统一深度学习框架,用于多模态脑水肿检测,该框架结合了结构性头部CT(HCT)和常规临床元数据。HCT提供了丰富的空间信息,而临床变量如年龄、实验室数值和扫描时间则捕捉了可能被忽视或简单拼接的互补上下文。AttentionMixer旨在以原则性和高效的方式融合这些异构数据源。HCT体积首先通过自监督视觉变换器自编码器(ViT-AE++)进行编码,无需大量标注数据集。临床元数据被映射到相同的特征空间,并作为交叉注意力模块中的键和值,HCT衍生的特征向量则服务于该模块的功能。
📄 English Summary
A Multimodal Deep Learning Framework for Edema Classification Using HCT and Clinical Data
AttentionMixer is proposed as a unified deep learning framework for the multimodal detection of brain edema, integrating structural head CT (HCT) with routine clinical metadata. While HCT provides rich spatial information, clinical variables such as age, laboratory values, and scan timing capture complementary context that may be overlooked or naively concatenated. AttentionMixer is designed to fuse these heterogeneous sources in a principled and efficient manner. HCT volumes are first encoded using a self-supervised Vision Transformer Autoencoder (ViT-AE++), eliminating the need for large labeled datasets. Clinical metadata are mapped into the same feature space and utilized as keys and values in a cross-attention module, where the HCT-derived feature vector serves a crucial role.
Powered by Cloudflare Workers + Payload CMS + Claude 3.5
数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等