DAR:道义推理与智能体框架
阅读原文· arxiv.orgDAR(Deontic Agentic Reasoning)是一种让模型按需与法规交互的智能体推理设置,用于解决应用规则和策略回答具体事实问题的道义推理任务。在DeonticBench困难子集上的评估发现,智能体框架能推动道义推理的前沿性能,但改进并不均匀:较弱的模型在数值任务中表现退化,同时消耗更多模型token。
Deontic reasoning is the task of answering questions by applying explicit rules and policies to case-specific facts, for example computing tax liability under a statute or determining the outcome of an immigration appeal. A key technical challenge for LLM-based deontic reasoning is that the relevant ruleset can be long and cross-referenced, so models may still fail to locate the rules needed for a particular reasoning step. We introduce Deontic Agentic Reasoning (DAR), an agentic reasoning setup in which the model interacts with the statutes on demand. We evaluate DAR under multiple harnesses on hard subsets of DeonticBench. Across these settings, we find that agentic harnesses can push the frontier on deontic reasoning tasks, but improvements are not uniform: weaker models often degrade on numerical tasks while consuming far more tokens.