High-Dimensional Limit of Stochastic Gradient Flow via Dynamical Mean-Field Theory
arXiv:2602.06320v2 Announce Type: replace Abstract: Modern machine learning models are typically trained via multi-pass stochastic gradient descent (SGD) with small batch sizes, and understanding their dynamics in high dimensions is of great interest. However, an analytical framework for describing the high-dimensional asymptotic behavior of multi-pass SGD with small batch sizes for nonlinear models is currently missing. In this study, we address this gap by analyzing the high-dimensional dynamics of a stochastic differential equation called a emph{stochastic gradient flow} […]