论文标题
在混杂的奖励下进行顺序反事实决策
Sequential Counterfactual Decision-Making Under Confounded Reward
论文作者
论文摘要
当利益因素与效果混淆时,我们研究了随机试验的局限性,即通过对代理人的自然偏爱输入软干预的反事实策略空间进行正式的效果。
We investigate the limitations of random trials when the cause of interest is confounded with the effect by formalizing a counterfactual policy-space where the agent's natural predilection is input to a soft-intervention.