Get the latest tech news

Modular Manifolds


A geometric framework for co-designing neural net optimizers with manifold constraints.

The abstraction rests upon a key observation made in our paper on the modular norm, that budgeting learning rates—both across layers and when scaling up individual layers—is intimately tied to understanding the Lipschitz sensitivity of the network output with respect to the weights. While hard manifold constraints may not ultimately be the right way to constrain weight matrices, they exemplify the idea of tightly co-designing optimization algorithms with architecural components. The goal of the Modula project is to build a library that automatically compiles steepest descent optimizers along with Lipschitz statements for general architectures.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Thinking Machines

Thinking Machines

Photo of Modular Manifolds

Modular Manifolds

Related news:

News photo

Mira Murati says her startup Thinking Machines will release new product in ‘months’ with ‘significant open source component’

News photo

Murati’s Thinking Machines Raises Cash at $10 Billion Valuation

News photo

Ex-OpenAI CTO Mira Murati unveils Thinking Machines: A startup focused on multimodality, human-AI collaboration