A Dream of Spring for Open-Weight LLMs: 10 Architectures from Jan-Feb 2026

·Sebastian Raschka··

If you have struggled a bit to keep up with open-weight model releases this month, this article should catch you up on the main themes.In this article, I will walk you through the ten main releases in chronological order, with a focus on the architecture similarities and differences:Arcee AI’s Trinity Large (Jan 27, 2026)Moonshot AI’s Kimi K2.5 (Jan 27, 2026)StepFun Step 3.5 Flash (Feb 1, 2026)Qwen3-Coder-Next (Feb 3, 2026)z.AI’s GLM-5 (Feb 12, 2026)MiniMax M2.5 (Feb 12, 2026)Nanbeige 4.1 3B (Fe...

Read full article →

Related Articles

OpenAI’s o1 correctly diagnosed 67% of ER patients vs. 50-55% by triage doctors
donsupreme · Hacker News · 2mo ago
Accelerating Gemma 4: faster inference with multi-token prediction drafters
amrrs · Hacker News · 2mo ago
A couple million lines of Haskell: Production engineering at Mercury
unignorant · Hacker News · 2mo ago
Show HN: I trained a language model that thinks the capital of Japan is Paris
farisallafi · Hacker News · 9h ago
Using “underdrawings” for accurate text and numbers
samcollins · Hacker News · 2mo ago