On the Exploitability of FTRL Dynamics
arXiv:2604.05129v1 Announce Type: new Abstract: In this paper we investigate the exploitability of a Follow-the-Regularized-Leader (FTRL) learner with constant step size $eta$ in $ntimes m$ two-player zero-sum games played over $T$ rounds against a clairvoyant optimizer. In contrast with prior analysis, we show that exploitability is an inherent feature of the FTRL family, rather than an artifact of specific instantiations. First, for fixed optimizer, we establish a sweeping law of order $Omega(N/eta)$, proving that exploitation scales to the […]