The Quadratic Sandwich

·Hacker News··

If you have ever tried to minimize a function with gradient descent, you probably noticed that some functions are a joy to optimize and others are a nightmare. The difference often boils down to two properties: strong convexity and L-smoothness. These two concepts define a “sandwich” of quadratic bounds around your function that tells you exactly how well-behaved it is. If the sandwich is tight, life is good. If one slice of bread is missing, things get ugly fast.

Read full article →

Related Articles

US–Indian space mission maps extreme subsidence in Mexico City
leopoldj · Hacker News · 21d ago
Why are neural networks and cryptographic ciphers so similar? (2025)
jxmorris12 · Hacker News · 21d ago
Fun with polynomials and linear algebra; or, slight abstract nonsense
LolWolf · Hacker News · 21d ago
The Mathematical Dance Inside Plant Cells
isaacfrond · Hacker News · 17d ago
Easy Random Trees
aebtebeten · Hacker News · 17d ago