VibeThinker-3B and the Strength of Post-Training

Short note on VibeThinker-3B, a 3B model based on Qwen2.5-Coder-3B whose reported coding and reasoning results point to strong post-training.

Liked Liked