Characterizing Dynamical Stability of Stochastic Gradient Descent in Overparameterized Learning

Open Access
Authors
Publication date 2025
Journal Journal of Machine Learning Research
Volume | Issue number 26
Number of pages 46
Organisations
  • Faculty of Science (FNWI) - Korteweg-de Vries Institute for Mathematics (KdVI)
Abstract
For overparameterized optimization tasks, such as those found in modern machine learning, global minima are generally not unique. In order to understand generalization in these settings, it is vital to study to which minimum an optimization algorithm converges. The possibility of having minima that are unstable under the dynamics imposed by the optimization algorithm limits the potential minima that the algorithm can find. In this paper, we characterize the global minima that are dynamically stable/unstable for both deterministic and stochastic gradient descent (SGD). In particular, we introduce a characteristic Lyapunov exponent that depends on the local dynamics around a global minimum and rigorously prove that the sign of this Lyapunov exponent determines whether SGD can accumulate at the respective global minimum.
Document type Article
Language English
Published at https://jmlr.org/papers/v26/24-1547.html
Other links https://www.scopus.com/pages/publications/105018668615
Downloads
Permalink to this page
Back