Landmark new METR report: Can AIs already start ‘rogue deployments’ inside AI companies? by 80000_Hours

·Nuno Sempere··

By Robert Wiblin | Watch on Youtube | Listen on SpotifyA red-teamer was em­bed­ded in­side An­thropic for three weeks, told to imag­ine he was an evil Claude, and asked to figure out how to launch a ‘rogue AI de­ploy­ment’ with­out get­ting caught.It’s one part of a land­mark new re­port from METR — the out­fit be­hind the task-com­ple­tion time hori­zon graph which has be­come the sin­gle most watched mea­sure of AI progress.This ma­jor new re­search push is be­ing con­ducted with close col­lab...

Read full article →

Related Articles

Will the next full gemini model be frontier at coding?
Ian Shea · Manifold Markets · 1d ago
Will there be more than 100 cases of ebola in the US in 2026?
Joseph Caissie · Manifold Markets · 2d ago
Robots reliably do my laundry by?
Mochi · Manifold Markets · 2d ago
Will the next full gemini model be as good as opus 4.7 or gpt 5.5 at coding?
Ian Shea · Manifold Markets · 2d ago
Will Elon Musk become a trillionaire before July 2026?
Jack · Manifold Markets · 2d ago