Stochastic approximation in non-markovian environments revisited

arXiv:2603.21091v1 Announce Type: new
Abstract: Based on some recent work of the author on stochastic approximation in non-markovian environments, the situation when the driving random process is non-ergodic in addition to being non-markovian is considered. Using this, we propose an analytic framework for understanding transformer based learning, specifically, the `attention’ mechanism, and continual learning, both of which depend on the entire past in principle.

Liked Liked