A Dream of Spring for Open-Weight LLMs: 10 Architectures from Jan-Feb 2026

·Sebastian Raschka··

If you have struggled a bit to keep up with open-weight model releases this month, this article should catch you up on the main themes.In this article, I will walk you through the ten main releases in chronological order, with a focus on the architecture similarities and differences:Arcee AI’s Trinity Large (Jan 27, 2026)Moonshot AI’s Kimi K2.5 (Jan 27, 2026)StepFun Step 3.5 Flash (Feb 1, 2026)Qwen3-Coder-Next (Feb 3, 2026)z.AI’s GLM-5 (Feb 12, 2026)MiniMax M2.5 (Feb 12, 2026)Nanbeige 4.1 3B (Fe...

Read full article →

Related Articles

Accelerating Gemma 4: faster inference with multi-token prediction drafters
amrrs · Hacker News · 3d ago
ProgramBench: Can language models rebuild programs from scratch?
jonbaer · Hacker News · 1d ago
ZAYA1-8B matches DeepSeek-R1 on math with less than 1B active parameters
steveharing1 · Hacker News · 1d ago
OpenAI’s o1 correctly diagnosed 67% of ER patients vs. 50-55% by triage doctors
donsupreme · Hacker News · 6d ago
A couple million lines of Haskell: Production engineering at Mercury
unignorant · Hacker News · 6d ago