Landmark new METR report: Can AIs already start ‘rogue deployments’ inside AI companies? by 80000_Hours

·Nuno Sempere··

By Robert Wiblin | Watch on Youtube | Listen on SpotifyA red-teamer was em­bed­ded in­side An­thropic for three weeks, told to imag­ine he was an evil Claude, and asked to figure out how to launch a ‘rogue AI de­ploy­ment’ with­out get­ting caught.It’s one part of a land­mark new re­port from METR — the out­fit be­hind the task-com­ple­tion time hori­zon graph which has be­come the sin­gle most watched mea­sure of AI progress.This ma­jor new re­search push is be­ing con­ducted with close col­lab...

Read full article →

Related Articles

will an AI get a nobel prize before 2040?
Cimorene Blume · Manifold Markets · 20h ago
How to Solve AI Biosecurity by Sophie Kim
Sophie Kim · Nuno Sempere · 1d ago
The Learning Trap: What Simulated Clueless Agents Reveal About the Unawareness Argument by dan.pandori 🔸
dan.pandori 🔸 · Nuno Sempere · 1d ago
To minimize the overall amount of suffering one causes, is it better to eat an organic Vegan diet, or eat a conventional Vegan diet and donate the money saved? Measured in neuron deaths of conscious animals and using AI estimates. Comparisons to non-Vegan diets included. by PreciousPig
PreciousPig · Nuno Sempere · 2d ago
Animal disenhancement by weganskie_miaso
weganskie_miaso · Nuno Sempere · 3d ago