Landmark new METR report: Can AIs already start ‘rogue deployments’ inside AI companies? by 80000_Hours
By Robert Wiblin | Watch on Youtube | Listen on SpotifyA red-teamer was embedded inside Anthropic for three weeks, told to imagine he was an evil Claude, and asked to figure out how to launch a ‘rogue AI deployment’ without getting caught.It’s one part of a landmark new report from METR — the outfit behind the task-completion time horizon graph which has become the single most watched measure of AI progress.This major new research push is being conducted with close collab...
Read full article →