用于视觉性别分类的解释性互动机器学习偏见缓解

出处: Explanatory Interactive Machine Learning for Bias Mitigation in Visual Gender Classification

发布: 2026年2月17日

📄 中文摘要

解释性互动学习（XIL）使用户能够通过对模型解释的反馈来引导机器学习（ML）中的模型训练，从而帮助模型关注用户视角下与预测相关的特征。本研究探索了这一学习范式在缓解视觉分类器中的偏见和虚假相关性方面的能力，特别是在性别分类等易受数据偏见影响的场景中。研究考察了两种方法论上不同的最先进的XIL策略，即CAIPI和“正确理由的正确性”（RRR），以及一种结合这两种策略的新型混合方法。通过比较分割掩膜与解释的结果，定量评估了这些方法的效果。

🏷️ 相关标签

#解释性互动学习 #偏见缓解 #视觉分类 #性别分类 #机器学习

📄 English Summary

Explanatory Interactive Machine Learning for Bias Mitigation in Visual Gender Classification

Explanatory interactive learning (XIL) allows users to guide model training in machine learning (ML) by providing feedback on model explanations, thus helping the model focus on features relevant to predictions from the user's perspective. This study explores the capability of this learning paradigm to mitigate bias and spurious correlations in visual classifiers, particularly in data-biased scenarios such as gender classification. Two methodologically distinct state-of-the-art XIL strategies, namely CAIPI and Right for the Right Reasons (RRR), are investigated, along with a novel hybrid approach that combines both strategies. The results are quantitatively evaluated by comparing segmentation masks with explanations.

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等

📄 中文摘要

🏷️ 相关标签

📄 English Summary

Explanatory Interactive Machine Learning for Bias Mitigation in Visual Gender Classification

🏷️ Related Tags

📚 相关文章

AI 编程创造了新一类创作者。我就是其中之一。

人工智能成为我学习的助手

Claude CLI "泄露": 没有人赢，AI 仍然幻觉，企业仍在犯同样的错误