论文标题
规范:多语言多元文化规范从直接的对话中发现
NormSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-Fly
论文作者
论文摘要
规范发现对于理解和推理人类交流和互动中的潜在侵犯和推理很重要。我们介绍了规范,这是一个基于语言模型的提示和自我验证的框架,用于解决对话基础的多语言,多文化规范发现的新任务。规范利用了预验证的GPT-3语言模型骨干的表达和隐性知识,通过代表规范发现任务和对话环境的定向问题来引起有关规范的知识。它进一步解决了语言模型幻觉的风险,并通过自我验证机制确保发现的规范是正确的,并且基本上是基于其源对话的。评估结果表明,与基准相比,我们的方法可以发现对话的相关性和有见地的规范(> 10+%的李克特量表等级)。从中国对话中发现的规范也可以与从英语对话中发现的洞察力和正确性相媲美(差异<3%)。此外,特定于文化的规范是有希望的质量,可以在人类鉴定中获得80%的精度。最后,我们可以扩展我们在规范发现自我验证中的基础过程,以实例化与给定的对话的依从性和违反任何规范,并具有解释性和透明度。规范在接地上达到了95.4%的AUC,自然语言解释与人写的质量相匹配。
Norm discovery is important for understanding and reasoning about the acceptable behaviors and potential violations in human communication and interactions. We introduce NormSage, a framework for addressing the novel task of conversation-grounded multi-lingual, multi-cultural norm discovery, based on language model prompting and self-verification. NormSAGE leverages the expressiveness and implicit knowledge of the pretrained GPT-3 language model backbone, to elicit knowledge about norms through directed questions representing the norm discovery task and conversation context. It further addresses the risk of language model hallucination with a self-verification mechanism ensuring that the norms discovered are correct and are substantially grounded to their source conversations. Evaluation results show that our approach discovers significantly more relevant and insightful norms for conversations on-the-fly compared to baselines (>10+% in Likert scale rating). The norms discovered from Chinese conversation are also comparable to the norms discovered from English conversation in terms of insightfulness and correctness (<3% difference). In addition, the culture-specific norms are promising quality, allowing for 80% accuracy in culture pair human identification. Finally, our grounding process in norm discovery self-verification can be extended for instantiating the adherence and violation of any norm for a given conversation on-the-fly, with explainability and transparency. NormSAGE achieves an AUC of 95.4% in grounding, with natural language explanation matching human-written quality.