NOTAI.AI:基于曲率和特征归因的机器生成文本可解释检测
📄 中文摘要
NOTAI.AI 是一个可解释的机器生成文本检测框架,通过将基于曲率的信号与神经和风格特征结合,扩展了 Fast-DetectGPT 的功能。该系统在监督学习的环境下,结合了 17 种可解释特征,包括条件概率曲率、ModernBERT 检测器得分、可读性指标和风格线索,使用梯度提升树(XGBoost)元分类器来判断文本是人类生成还是 AI 生成。此外,NOTAI.AI 采用 Shapley 加性解释(SHAP)方法提供局部和全局特征级别的归因。这些归因进一步通过基于大型语言模型的解释层转化为结构化的自然语言理由,增强了可解释性。
📄 English Summary
NOTAI.AI: Explainable Detection of Machine-Generated Text via Curvature and Feature Attribution
NOTAI.AI is an explainable framework for detecting machine-generated text that enhances Fast-DetectGPT by integrating curvature-based signals with neural and stylometric features in a supervised setting. The system combines 17 interpretable features, including Conditional Probability Curvature, ModernBERT detector score, readability metrics, and stylometric cues, within a gradient-boosted tree (XGBoost) meta-classifier to determine whether a text is human- or AI-generated. Furthermore, NOTAI.AI employs Shapley Additive Explanations (SHAP) to provide both local and global feature-level attributions. These attributions are further translated into structured natural-language rationales through an LLM-based explanation layer, enhancing interpretability.
Powered by Cloudflare Workers + Payload CMS + Claude 3.5
数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等