论文标题
通过随机优势比较两个样本:一种图形方法
Comparing Two Samples Through Stochastic Dominance: A Graphical Approach
论文作者
论文摘要
在现实世界中,非确定性的测量很常见:随机优化算法的性能或在混乱环境中加强学习代理的总奖励只是两个例子,其中不可预测的结果很常见。这些度量可以建模为随机变量,并通过其预期值或更复杂的工具(例如原假设统计检验)相互比较。在本文中,我们提出了一个替代框架,根据它们的估计分布函数在视觉上比较两个样本。首先,我们引入了两个随机变量的优势度量,该变量量化了一个随机变量之一随机变量的累积分布函数随机占据另一个变量的比例。然后,我们提出了一种在分位数中分解的图形方法i)提出的优势度量和ii)一个随机变量之一比另一个变量较低的值的概率。出于说明性目的,我们通过提出的方法重新评估了已经发表的工作的实验,我们表明可以推断出其他结论(通过其余方法错过)。此外,将软件包rvCompare创建为应用和实验所提出的框架的一种方便方式。
Non-deterministic measurements are common in real-world scenarios: the performance of a stochastic optimization algorithm or the total reward of a reinforcement learning agent in a chaotic environment are just two examples in which unpredictable outcomes are common. These measures can be modeled as random variables and compared among each other via their expected values or more sophisticated tools such as null hypothesis statistical tests. In this paper, we propose an alternative framework to visually compare two samples according to their estimated cumulative distribution functions. First, we introduce a dominance measure for two random variables that quantifies the proportion in which the cumulative distribution function of one of the random variables stochastically dominates the other one. Then, we present a graphical method that decomposes in quantiles i) the proposed dominance measure and ii) the probability that one of the random variables takes lower values than the other. With illustrative purposes, we re-evaluate the experimentation of an already published work with the proposed methodology and we show that additional conclusions (missed by the rest of the methods) can be inferred. Additionally, the software package RVCompare was created as a convenient way of applying and experimenting with the proposed framework.