Get the latest tech news

MHC: Manifold-Constrained Hyper-Connections


Recently, studies exemplified by Hyper-Connections (HC) have extended the ubiquitous residual connection paradigm established over the past decade by expanding the residual stream width and diversifying connectivity patterns. While yielding substantial performance gains, this diversification fundamentally compromises the identity mapping property intrinsic to the residual connection, which causes severe training instability and restricted scalability, and additionally incurs notable memory access overhead. To address these challenges, we propose Manifold-Constrained Hyper-Connections (mHC), a general framework that projects the residual connection space of HC onto a specific manifold to restore the identity mapping property, while incorporating rigorous infrastructure optimization to ensure efficiency. Empirical experiments demonstrate that mHC is effective for training at scale, offering tangible performance improvements and superior scalability. We anticipate that mHC, as a flexible and practical extension of HC, will contribute to a deeper understanding of topological architecture design and suggest promising directions for the evolution of foundational models.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of connections

connections

Photo of MHC

MHC

Related news:

News photo

Why You’re Better Than a Computer at Solving Connections

News photo

UK Grid Overwhelmed by Data-Center Requests for Connections

News photo

Celebrating the partners driving Disrupt’s big ideas, connections, and community