关于通过Wasserstein分布的概括和正规化优化

论文标题

关于通过Wasserstein分布的概括和正规化优化

On Generalization and Regularization via Wasserstein Distributionally Robust Optimization

论文作者

Wu, Qinyu, Li, Jonathan Yu-Meng, Mao, Tiantian

论文摘要

Wasserstein分布在强大的优化方面（DRO）在运营研究和机器学习方面已成为一种有力的方法，用于实现具有良好样本外部性能的解决方案。关于其成功的两个引人注目的解释是从Wasserstein Dro中得出的概括范围及其等于机器学习中常用的正则化方案。但是，现有关于概括范围和正则化等效性的结果在很大程度上仅限于Wasserstein Ball具有特定类型的设置，并且决策标准采用某些形式的预期功能。在本文中，我们表明可以在更广泛的环境中获得概括范围和正则化等效性，在此设置中，瓦斯坦斯坦球是一般类型的，决策标准可容纳任何形式，包括一般风险措施。这不仅解决了重要的机器学习和操作管理应用程序，而且还扩展到Wasserstein Dro的一般决策理论框架。我们的结果很强，因为概括范围不受维数的诅咒，而正规化的等效性也是如此。作为副产品，我们表明Wasserstein Dro与最近在仿射决策规则下的{\ IT}决策标准的最近的Max-Sined Wasserstein Dro相吻合 - 通过我们的一般正则化结果可以有效地作为凸面程序可有效地解决。这些一般保证为扩大瓦斯史坦·dro（Wasserstein Dro）的应用而跨越了数据驱动的决策问题的范围。

Wasserstein distributionally robust optimization (DRO) has gained prominence in operations research and machine learning as a powerful method for achieving solutions with favorable out-of-sample performance. Two compelling explanations for its success are the generalization bounds derived from Wasserstein DRO and its equivalence to regularization schemes commonly used in machine learning. However, existing results on generalization bounds and regularization equivalence are largely limited to settings where the Wasserstein ball is of a specific type, and the decision criterion takes certain forms of expected functions. In this paper, we show that generalization bounds and regularization equivalence can be obtained in a significantly broader setting, where the Wasserstein ball is of a general type and the decision criterion accommodates any form, including general risk measures. This not only addresses important machine learning and operations management applications but also expands to general decision-theoretical frameworks previously unaddressed by Wasserstein DRO. Our results are strong in that the generalization bounds do not suffer from the curse of dimensionality and the equivalency to regularization is exact. As a by-product, we show that Wasserstein DRO coincides with the recent max-sliced Wasserstein DRO for {\it any} decision criterion under affine decision rules -- resulting in both being efficiently solvable as convex programs via our general regularization results. These general assurances provide a strong foundation for expanding the application of Wasserstein DRO across diverse domains of data-driven decision problems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题