Landmark new METR report: Can AIs already start ‘rogue deployments’ inside AI companies?

·EA Forum··

Published on May 20, 2026 4:30 PM GMTBy Robert Wiblin | Watch on Youtube | Listen on SpotifyA red-teamer was embedded inside Anthropic for three weeks, told to imagine he was an evil Claude, and asked to figure out how to launch a ‘rogue AI deployment’ without getting caught.It’s one part of a landmark new report from METR — the outfit behind the task-completion time horizon graph which has become the single most watched measure of AI progress.This major new research push is being conducted with...

Read full article →

Related Articles

“Beyond the limit”: Satellites and mirrors in space pose threat to the night sky
Breadmaker · Hacker News · 1d ago
Solar rail could become common in Europe after successful trial in Switzerland
neilfrndes · Hacker News · 2h ago
GPT-5.5 Codex reasoning-token clustering may be leading to degraded performance
maille · Hacker News · 19h ago
Potential session/cache leakage between workspace instances or consumer accounts
chatmasta · Hacker News · 1d ago
Show HN: KiCad in the Browser
ViktorEE · Hacker News · 5h ago