Anthropic Says ‘Evil’ Portrayals of AI Were Responsible For Claude’s Blackmail Attempts
An anonymous reader quotes a report from TechCrunch: Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic. Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avo … ⌘ Read more