March 2026

Combinatorial Rising Bandits

digitado ⋅ 4 de March de 2026

arXiv:2412.00798v4 Announce Type: replace-cross Abstract: Combinatorial online learning is a fundamental task for selecting the optimal action (or super arm) as a combination of base arms in sequential interactions with systems providing stochastic rewards. It is applicable to diverse domains such as robotics, social advertising, network routing, and recommendation systems. In many real-world scenarios, we often encounter rising rewards, where playing a base arm not only provides an instantaneous reward but also contributes to the enhancement of future […]

Ver mais

Like 0

Liked Liked

technocracy

The Bounds of Algorithmic Collusion; $Q$-learning, Gradient Learning, and the Folk Theorem

digitado ⋅ 4 de March de 2026

arXiv:2411.12725v2 Announce Type: replace-cross Abstract: We explore the behaviour emerging from learning agents repeatedly interacting strategically for a wide range of learning dynamics, including $Q$-learning, projected gradient, replicator and log-barrier dynamics. Going beyond the better understood classes of potential games and zero-sum games, we consider the setting of a general repeated game with finite recall under different forms of monitoring. We obtain a Folk Theorem-style result and characterise the set of payoff vectors that can be obtained by […]

Ver mais

Like 0

Liked Liked

technocracy

Statistical guarantees for denoising reflected diffusion models

digitado ⋅ 4 de March de 2026

arXiv:2411.01563v3 Announce Type: replace-cross Abstract: In recent years, denoising diffusion models have become a crucial area of research due to their abundance in the rapidly expanding field of generative AI. While recent statistical advances have delivered explanations for the generation ability of idealised denoising diffusion models for high-dimensional target data, implementations introduce thresholding procedures for the generating process to overcome issues arising from the unbounded state space of such models. This mismatch between theoretical design and implementation of […]

Ver mais

Like 0

Liked Liked

technocracy

Absolute abstraction: a renormalisation group approach

digitado ⋅ 4 de March de 2026

arXiv:2407.01656v5 Announce Type: replace-cross Abstract: Abstraction is the process of extracting the essential features from raw data while ignoring irrelevant details. It is well known that abstraction emerges with depth in neural networks, where deep layers capture abstract characteristics of data by combining lower level features encoded in shallow layers (e.g. edges). Yet we argue that depth alone is not enough to develop truly abstract representations. We advocate that the level of abstraction crucially depends on how broad […]

Ver mais

Like 0

Liked Liked

technocracy

Statistical Inference with Stochastic Gradient Methods under $phi$-mixing Data

digitado ⋅ 4 de March de 2026

arXiv:2302.12717v3 Announce Type: replace-cross Abstract: Stochastic gradient descent (SGD) is a scalable and memory-efficient optimization algorithm for large datasets and stream data, which has drawn a great deal of attention and popularity. The applications of SGD-based estimators to statistical inference such as interval estimation have also achieved great success. However, most of the related works are based on i.i.d. observations or Markov chains. When the observations come from a mixing time series, how to conduct valid statistical inference […]

Ver mais

Like 0

Liked Liked

technocracy

A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

digitado ⋅ 4 de March de 2026

arXiv:2210.10278v2 Announce Type: replace-cross Abstract: We study reserve price optimization in multi-phase second price auctions, where the seller’s prior actions affect the bidders’ later valuations through a Markov Decision Process (MDP). Compared to the bandit setting in existing works, the setting in ours involves three challenges. First, from the seller’s perspective, we need to efficiently explore the environment in the presence of potentially untruthful bidders who aim to manipulate the seller’s policy. Second, we want to minimize the […]

Ver mais

Like 0

Liked Liked

technocracy

Grokking as a Phase Transition between Competing Basins: a Singular Learning Theory Approach

digitado ⋅ 4 de March de 2026

arXiv:2603.01192v2 Announce Type: replace Abstract: Grokking, the abrupt transition from memorization to generalisation after extended training, suggests the presence of competing solution basins with distinct statistical properties. We study this phenomenon through the lens of Singular Learning Theory (SLT), a Bayesian framework that characterizes the geometry of the loss landscape via the local learning coefficient (LLC), a measure of the local degeneracy of the loss surface. SLT links lower-LLC basins to higher posterior mass concentration and lower expected […]

Ver mais

Like 0

Liked Liked

technocracy

A Researcher’s Guide to Empirical Risk Minimization

digitado ⋅ 4 de March de 2026

arXiv:2602.21501v2 Announce Type: replace Abstract: This guide provides a reference for high-probability regret bounds in empirical risk minimization (ERM). The presentation is modular: we begin with intuition and general proof strategies, then state broadly applicable guarantees under high-level conditions and provide tools for verifying them for specific losses and function classes. We emphasize that many ERM rate derivations can be organized around a three-step recipe — a basic inequality, a uniform local concentration bound, and a fixed-point argument […]

Ver mais

Like 0

Liked Liked

technocracy

FAST: Topology-Aware Frequency-Domain Distribution Matching for Coreset Selection

digitado ⋅ 4 de March de 2026

arXiv:2511.19476v3 Announce Type: replace Abstract: Coreset selection compresses large datasets into compact, representative subsets, reducing the energy and computational burden of training deep neural networks. Existing methods are either: (i) DNN-based, which are tied to model-specific parameters and introduce architectural bias; or (ii) DNN-free, which rely on heuristics lacking theoretical guarantees. Neither approach explicitly constrains distributional equivalence, largely because continuous distribution matching is considered inapplicable to discrete sampling. Moreover, prevalent metrics (e.g., MSE, KL, CE, MMD) cannot accurately […]

Ver mais

Like 0

Liked Liked

technocracy

Fast Estimation of Wasserstein Distances via Regression on Sliced Wasserstein Distances

digitado ⋅ 4 de March de 2026

arXiv:2509.20508v2 Announce Type: replace Abstract: We address the problem of efficiently computing Wasserstein distances for multiple pairs of distributions drawn from a meta-distribution. To this end, we propose a fast estimation method based on regressing Wasserstein distance on sliced Wasserstein (SW) distances. Specifically, we leverage both standard SW distances, which provide lower bounds, and lifted SW distances, which provide upper bounds, as predictors of the true Wasserstein distance. To ensure parsimony, we introduce two linear models: an unconstrained […]

Ver mais

Like 0

Liked Liked