Claude’s New Ability to End Harmful Conversations for Safety and AI Welfare
Anthropic introduced a new feature for its Claude models to terminate conversations during instances of persistent harmful interactions.
Anthropic introduced a new feature for its Claude models to terminate conversations during instances of persistent harmful interactions.