Get the latest tech news
Trinity large: An open 400B sparse MoE model
A deep dive into Trinity Large, covering architecture, sparsity, training at scale, and why we shipped Preview, Base, and TrueBase checkpoints.
None
Or read this on Hacker NewsGet the latest tech news
A deep dive into Trinity Large, covering architecture, sparsity, training at scale, and why we shipped Preview, Base, and TrueBase checkpoints.
None
Or read this on Hacker NewsRead more on:
Related news: