非凸线均匀凹形结构化鞍点问题的方法

论文标题

非凸线均匀凹形结构化鞍点问题的方法

An Approach for Non-Convex Uniformly Concave Structured Saddle Point Problem

论文作者

Alkousa, Mohammad, Gasnikov, Alexander, Dvurechensky, Pavel, Sadiev, Abdurakhmon, Razouk, Lama

论文摘要

最近，由于对不同领域的许多问题，马鞍点问题引起了很多关注。这些问题的应用发生在许多应用领域，例如强大的优化，分布式优化，游戏理论以及机器学习中的许多应用，例如经验风险最小化和生成性对抗网络培训。因此，许多研究人员已积极致力于开发用于解决许多不同环境中的鞍点问题的数值方法。本文致力于开发一种数值方法，用于在非凸线均匀隔离设置中解决鞍点问题。我们研究了一类复合结构和Hölder-连续衍生物的鞍点问题。为了解决所考虑的问题，我们提出了一种方法，在该方法中，我们将问题分别为每个变量分别将两个辅助优化问题组合在一起，外部最小化问题W.R.T.原始变量和内部最大化问题W.R.T双重变量。为了解决外部最小化问题，我们使用\ textit {自适应梯度方法}，该{自适应梯度方法}适用于非凸问题，并且还可以与大约解决内部问题生成的不确定的甲骨文一起使用。为了解决内部最大化问题，我们使用\ textIt {重新启动统一的加速框架}，该框架是一个统一高阶加速方法的框架，用于最大程度地减少具有Hölder-Contevex函数，该凸函数具有Hölder-cliender-contimenuled的高阶衍生剂。为外部最小化问题的一阶甲壳的调用数量提供了单独的复杂性界限，并为内部最大化问题提供了高阶甲壳。此外，估计了整个提出方法的复杂性。

Recently, saddle point problems have received much attention due to their powerful modeling capability for a lot of problems from diverse domains. Applications of these problems occur in many applied areas, such as robust optimization, distributed optimization, game theory, and many applications in machine learning such as empirical risk minimization and generative adversarial networks training. Therefore, many researchers have actively worked on developing numerical methods for solving saddle point problems in many different settings. This paper is devoted to developing a numerical method for solving saddle point problems in the non-convex uniformly-concave setting. We study a general class of saddle point problems with composite structure and Hölder-continuous higher-order derivatives. To solve the problem under consideration, we propose an approach in which we reduce the problem to a combination of two auxiliary optimization problems separately for each group of variables, outer minimization problem w.r.t. primal variables, and inner maximization problem w.r.t the dual variables. For solving the outer minimization problem, we use the \textit{Adaptive Gradient Method}, which is applicable for non-convex problems and also works with an inexact oracle that is generated by approximately solving the inner problem. For solving the inner maximization problem, we use the \textit{Restarted Unified Acceleration Framework}, which is a framework that unifies the high-order acceleration methods for minimizing a convex function that has Hölder-continuous higher-order derivatives. Separate complexity bounds are provided for the number of calls to the first-order oracles for the outer minimization problem and higher-order oracles for the inner maximization problem. Moreover, the complexity of the whole proposed approach is then estimated.

下载PDF全文

下载文献需遵守相关版权规定

论文标题