INSURE-Dial:一种面向阶段的对话数据集与合规性验证基准
📄 中文摘要
INSURE-Dial 是首个公开的基准,旨在开发和评估合规意识的语音代理,以实现阶段感知的电话审核和基于跨度的合规性验证。该数据集包含 50 个去标识化的 AI 发起的电话,与现场保险代表进行交流(每通电话平均 71 次对话轮次),以及 1,000 个合成生成的电话,模拟相同的工作流程。所有电话均使用阶段结构的 JSON 模式进行注释,涵盖 IVR 导航、患者识别、保险覆盖状态、药物检查(最多两种药物)和代理识别(CRN)。
📄 English Summary
INSURE-Dial: A Phase-Aware Conversational Dataset \& Benchmark for Compliance Verification and Phase Detection
INSURE-Dial is the first public benchmark designed for the development and assessment of compliance-aware voice agents aimed at phase-aware call auditing and span-based compliance verification. The dataset comprises 50 de-identified, AI-initiated calls with live insurance representatives, averaging 71 turns per call, and 1,000 synthetically generated calls that mirror the same workflow. All calls are annotated with a phase-structured JSON schema that covers IVR navigation, patient identification, coverage status, medication checks (up to two drugs), and agent identification (CRN).
Powered by Cloudflare Workers + Payload CMS + Claude 3.5
数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等