Reviewing my GPT-5 predictions

·Stephen Malina··

Earlier today (on August 7th), I realized that I was about to miss a golden opportunity to quickly test how calibrated I was about AI progress by predicting some GPT-5 benchmark scores. Upon realizing this, I dashed off some quick predictions based on a combination of then-current top benchmark scores and my intuition about how big a jump GPT-5 would be over o3, Grok 4, Claude Opus 4, and Gemini 2.5 Pro. You can see the predictions and current resolutions in the next section. My one sentence sum...

Read full article →

Related Articles

Does Employment Slow Cognitive Decline? Evidence from Labor Market Shocks
littlexsparkee · Hacker News · 4d ago
Underwater robot tracks sperm whale conversations in real time
thedebuglife · Hacker News · 5d ago
Linking spatial biology and clinical histology via Haiku
Yan Cui, Jacob S. Leiby, Wenhui Lei, Dokyoon Kim, Yanxiang Deng, Aaron T. Mayer, Zhenqin Wu, Alexandro E. Trevino, Zhi Huang · ArXiv cs.LG · 3d ago
CGM-JEPA: Learning Consistent Continuous Glucose Monitor Representations via Predictive Self-Supervised Pretraining
Hada Melino Muhammad, Zechen Li, Flora Salim, Ahmed A. Metwally · ArXiv cs.LG · 3d ago
CellxPert: Inference-Time MCMC Steering of a Multi-Omics Single-Cell Foundation Model for In-Silico Perturbation
Andac Demir, Erik W. Anderson, Jeremy L. Jenkins, Srayanta Mukherjee · ArXiv q-bio · 3d ago