The firm investigated four distinct “sabotage” threat vectors for AI and determined that “minimal mitigations” were sufficient for current models.
Anthropic says AI could one day ‘sabotage’ humanity but it’s fine for now
RELATED ARTICLES
The firm investigated four distinct “sabotage” threat vectors for AI and determined that “minimal mitigations” were sufficient for current models.