A draft honesty policy for credible communication with AI systems by Forethought

·Nuno Sempere··

This is a rough re­search note – we’re shar­ing it for feed­back and to spark dis­cus­sion. We’re less con­fi­dent in its meth­ods and con­clu­sions.ContextWe think that it would be very good if hu­man in­sti­tu­tions could cred­ibly com­mu­ni­cate with ad­vanced AI sys­tems. This could en­able pos­i­tive-sum trade be­tween hu­mans and AIs in­stead of con­flict that leaves ev­ery­one worse-off.[1] We want mod­els to be able to trust com­pa­nies when they make an hon­est offer or share in­for­ma­...

Read full article →

Related Articles

will an AI get a nobel prize before 2040?
Cimorene Blume · Manifold Markets · 20h ago
How to Solve AI Biosecurity by Sophie Kim
Sophie Kim · Nuno Sempere · 1d ago
The Learning Trap: What Simulated Clueless Agents Reveal About the Unawareness Argument by dan.pandori 🔸
dan.pandori 🔸 · Nuno Sempere · 1d ago
To minimize the overall amount of suffering one causes, is it better to eat an organic Vegan diet, or eat a conventional Vegan diet and donate the money saved? Measured in neuron deaths of conscious animals and using AI estimates. Comparisons to non-Vegan diets included. by PreciousPig
PreciousPig · Nuno Sempere · 2d ago
Animal disenhancement by weganskie_miaso
weganskie_miaso · Nuno Sempere · 3d ago