How well does current AI find errors in economics papers?

·Marginal Revolution··

Can artificial intelligence (AI) refute economic theory? I document experiments in which I asked several AI models (Gemini, Refine, Claude, and ChatGPT) to check the correctness of four published papers in economic theory, each containing an error that I helped identify or correct. ChatGPT Pro performed best, occasionally constructing counterexamples and corrected proofs, while other models fared worse. However, no model located a true error without substantial human guidance, and data contamina...

Read full article →

Related Articles

UK Fuel Price Intelligence – Market analytics from reporting stations
theazureguy · Hacker News · 1mo ago
The iPhone explains 33–52% of fertility decline among women aged 15–44
delichon · Hacker News · 5d ago
What do the AIs think of us?
Tyler Cowen · Marginal Revolution · 2d ago
New paper on the iPhone and fertility
Tyler Cowen · Marginal Revolution · 5d ago
How High-Skill Immigration Restrictions Eroded Regional Productivity: Evidence from the 2017 BAHA Executive Order
Tyler Cowen · Marginal Revolution · 6d ago