Online Learning with Improving Agents: Multiclass, Budgeted Agents and Bandit Learners

digitado ⋅ 19 de February de 2026

We investigate the recently introduced model of learning with improvements, where agents are allowed to make small changes to their feature values to be warranted a more desirable label. We extensively extend previously published results by providing combinatorial dimensions that characterize online learnability in this model, by analyzing the multiclass setup, learnability in a bandit feedback setup, modeling agents’ cost for making improvements and more.

Like 0

Liked Liked