📄 中文摘要
苏木精和伊红(H&E)染色图像中的细胞核实例分割在自动化组织病理图像分析中至关重要,并在诸多下游任务中发挥作用。尽管针对细胞核实例分割已提出多种机器学习和深度学习方法,但当前研究多侧重于开发新的分割算法,并在有限且随意选择的公开数据集上进行基准测试。这种做法导致了方法间的比较缺乏全面性和系统性,难以准确评估不同算法在多样化场景下的真实性能。为了解决这一局限性,NucFuseRank 提出了一种创新的数据集融合策略,将多个来源的细胞核实例分割数据集进行整合,构建一个规模更大、多样性更强、代表性更广的综合性数据集。
📄 English Summary
NucFuseRank: Dataset Fusion and Performance Ranking for Nuclei Instance Segmentation
Nuclei instance segmentation in hematoxylin and eosin (H&E)-stained images is pivotal for automated histological image analysis, with diverse applications in downstream tasks. While numerous machine learning and deep learning approaches have been proposed for nuclei instance segmentation, current research predominantly focuses on developing novel segmentation algorithms and benchmarking them on a limited number of arbitrarily selected public datasets. This practice often results in an incomplete and unsystematic comparison between methods, making it challenging to accurately assess the true performance of different algorithms across varied scenarios. To address this limitation, NucFuseRank introduces an innovative dataset fusion strategy, integrating multiple nuclei instance segmentation datasets from various sources to construct a larger, more diverse, and representative comprehensive dataset. This fused dataset aims to encompass the complexities of different tissue types, staining variations, image qualities, and nuclear morphologies, thereby providing a more robust foundation for evaluating algorithm generalizability. Building upon this, NucFuseRank further establishes a systematic performance evaluation framework and ranking mechanism. This framework not only considers traditional segmentation metrics (e.g., Dice coefficient, Jaccard index, average precision) but also incorporates new metrics tailored to the specific characteristics of nuclei instance segmentation, offering a more comprehensive assessment of an algorithm's capabilities in detection, segmentation accuracy, and topological structure preservation. By rigorously testing and ranking existing mainstream algorithms on the fused dataset, NucFuseRank aims to provide a trustworthy and reproducible benchmark, assisting researchers in identifying state-of-the-art methods, understanding their strengths and weaknesses, and guiding future algorithm development.