Puzzling Success of Overparameterization: Lottery Tickets or Escape Dimensions?

·Hacker News··

Lotteries and tickets are often used as a didactical analogy to explain the success of overparameterized neural networks: “larger networks succeed because they more likely contain a well-initialized subnetwork that can learn the task in isolation, much like buying more tickets increases the chances of winning a lottery.” This explanation is intuitive but misleading: it suggests that subnetworks can be treated in isolation from the rest of the network. Following this reasoning leads to interpreti

Read full article →

Related Articles

Anthropic says Alibaba illicitly extracted Claude AI model capabilities
htrp · Hacker News · 19h ago
OpenAI unveils its first custom chip, built by Broadcom
jamdesk · Hacker News · 21h ago
LuaJIT 3.0 proposed syntax extensions
phreddypharkus · Hacker News · 15h ago
NSA lost access to Mythos amid Anthropic dispute
thm · Hacker News · 1d ago
AI's Affordability Crisis
ilreb · Hacker News · 2d ago