AURORA: 自适应统一表示用于鲁棒超声分析

📄 中文摘要

超声图像因扫描仪、操作员和解剖目标的不同而存在显著差异,这导致在一种环境中训练的模型在新医院和临床条件下的泛化能力较差。超声图像分析的基础模型挑战(FMC-UIA)反映了这一困难,要求单一模型处理多种任务,包括分割、检测、分类和地标回归,涵盖多种器官和数据集。提出了一种基于Qwen3-VL系列的变换器视觉编码器的统一多任务框架。中间标记特征被投影到空间特征图中,并通过轻量级多尺度特征金字塔进行融合,从而实现像素级预测和全局推理。该框架旨在提高超声图像分析的鲁棒性和适应性。

📄 English Summary

AURORA: Adaptive Unified Representation for Robust Ultrasound Analysis

Ultrasound images exhibit significant variations across scanners, operators, and anatomical targets, often leading to poor generalization of models trained in one setting to new hospitals and clinical conditions. The Foundation Model Challenge for Ultrasound Image Analysis (FMC-UIA) highlights this challenge by requiring a single model to perform multiple tasks, including segmentation, detection, classification, and landmark regression across diverse organs and datasets. A unified multi-task framework is proposed, based on a transformer visual encoder from the Qwen3-VL family. Intermediate token features are projected into spatial feature maps and fused using a lightweight multi-scale feature pyramid, enabling both pixel-level predictions and global reasoning. This framework aims to enhance the robustness and adaptability of ultrasound image analysis.

Powered by Cloudflare Workers + Payload CMS + Claude 3.5

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等