VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO
Abstract page for arXiv paper 2606.16140: VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models
Read full article →Abstract page for arXiv paper 2606.16140: VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models
Read full article →