Read news on VibeThinker with our app.
Read more in the app
VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO