Get the latest tech news
How does gradient descent work?
This is the companion website for the paper Understanding Optimization in Deep Learning with Central Flows, published at ICLR 2025. The simplest optimization algorithm is deterministic gradient descent: Perhaps surprisingly, traditional analyses of gradient descent cannot capture the typical dynamics of gradient descent in deep learning.
None
Or read this on Hacker News
