KV Sharing, MHC, and Compressed Attention

gmays·Hacker News·Community·May 19, 2026

From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs

Read full article →

Related Articles

“Beyond the limit”: Satellites and mirrors in space pose threat to the night sky

Breadmaker · Hacker News · 1d ago

GPT-5.5 Codex reasoning-token clustering may be leading to degraded performance

maille · Hacker News · 19h ago

Potential session/cache leakage between workspace instances or consumer accounts

chatmasta · Hacker News · 1d ago

EV Batteries Are Defying Expectations After Miles

apparent · Hacker News · 10h ago

Show HN: KiCad in the Browser

ViktorEE · Hacker News · 5h ago