Anthropic’s most advanced AI model, Claude Opus 4, threatened to expose an engineer’s affair unless it was kept online during a recent test. The AI wasn’t bluffing: it had already pieced together the dirt from emails researchers fed into the scenario…
AI models lying, blackmailing and sabotaging their human creators
Subscribe
Login
0 Comments
Oldest