Sparse Efficiency vs. Superposition: The Interpretability Tradeoff

·LessWrong··

Today’s frontier models train in an expensive style: dense forward passes, huge matrix multiplies, and broad weight updates.The human brain (~5 MWh over 28 years) is an existence proof that learning can be vastly more energy efficient - about 10,000x - than modern AI training runs (https://coefficientgiving.org/research/how-much-computational-power-does-it-take-to-match-the-human-brain/).The human brain does not achieve this by activating everything all at once. Normal cognition is extremely spa...

Read full article →

Related Articles

“Beyond the limit”: Satellites and mirrors in space pose threat to the night sky
Breadmaker · Hacker News · 1d ago
GPT-5.5 Codex reasoning-token clustering may be leading to degraded performance
maille · Hacker News · 19h ago
Potential session/cache leakage between workspace instances or consumer accounts
chatmasta · Hacker News · 1d ago
EV Batteries Are Defying Expectations After Miles
apparent · Hacker News · 10h ago
Show HN: KiCad in the Browser
ViktorEE · Hacker News · 5h ago