Logarithmic-Regret Quantum Learning Algorithms for Zero-Sum Games

04/27/2023
by   Minbo Gao, et al.
4

We propose the first online quantum algorithm for zero-sum games with Õ(1) regret under the game setting. Moreover, our quantum algorithm computes an ε-approximate Nash equilibrium of an m × n matrix zero-sum game in quantum time Õ(√(m+n)/ε^2.5), yielding a quadratic improvement over classical algorithms in terms of m, n. Our algorithm uses standard quantum inputs and generates classical outputs with succinct descriptions, facilitating end-to-end applications. As an application, we obtain a fast quantum linear programming solver. Technically, our online quantum algorithm "quantizes" classical algorithms based on the optimistic multiplicative weight update method. At the heart of our algorithm is a fast quantum multi-sampling procedure for the Gibbs sampling problem, which may be of independent interest.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset

Sign in with Google

×

Use your Google Account to sign in to DeepAI

×

Consider DeepAI Pro