A draft honesty policy for credible communication with AI systems by Forethought

·Nuno Sempere··

This is a rough re­search note – we’re shar­ing it for feed­back and to spark dis­cus­sion. We’re less con­fi­dent in its meth­ods and con­clu­sions.ContextWe think that it would be very good if hu­man in­sti­tu­tions could cred­ibly com­mu­ni­cate with ad­vanced AI sys­tems. This could en­able pos­i­tive-sum trade be­tween hu­mans and AIs in­stead of con­flict that leaves ev­ery­one worse-off.[1] We want mod­els to be able to trust com­pa­nies when they make an hon­est offer or share in­for­ma­...

Read full article →

Related Articles

Landmark new METR report: Can AIs already start ‘rogue deployments’ inside AI companies? by 80000_Hours
80000_Hours · Nuno Sempere · 22h ago
Will the next full gemini model be frontier at coding?
Ian Shea · Manifold Markets · 1d ago
Will there be more than 100 cases of ebola in the US in 2026?
Joseph Caissie · Manifold Markets · 2d ago
Robots reliably do my laundry by?
Mochi · Manifold Markets · 2d ago
Will the next full gemini model be as good as opus 4.7 or gpt 5.5 at coding?
Ian Shea · Manifold Markets · 2d ago