Reviewing my GPT-5 predictions

·Stephen Malina··

Earlier today (on August 7th), I realized that I was about to miss a golden opportunity to quickly test how calibrated I was about AI progress by predicting some GPT-5 benchmark scores. Upon realizing this, I dashed off some quick predictions based on a combination of then-current top benchmark scores and my intuition about how big a jump GPT-5 would be over o3, Grok 4, Claude Opus 4, and Gemini 2.5 Pro. You can see the predictions and current resolutions in the next section. My one sentence sum...

Read full article →

Related Articles

Does Employment Slow Cognitive Decline? Evidence from Labor Market Shocks
littlexsparkee · Hacker News · 16d ago
Group averages obscure how an individual's brain controls behavior: study
hhs · Hacker News · 20d ago
Underwater robot tracks sperm whale conversations in real time
thedebuglife · Hacker News · 17d ago
Researchers print structural colour with an inkjet printer
zeristor · Hacker News · 18d ago
Urban Birds Are Rising Earlier Because of Traffic Noise (2013)
thunderbong · Hacker News · 17d ago