Read news on 8b moe model with our app.
Read more in the app
ZAYA1-8B matches DeepSeek-R1 on math with less than 1B active parameters