News

The debate around whether it’s better to be an AI-powered app that millions of people use, like Cursor, or an AI model like ...
Harmful, abusive interactions plague AI chatbots. Researchers have found that AI companions like Character.AI, Nomi, and ...
Claude won't stick around for toxic convos. Anthropic says its AI can now end extreme chats when users push too far.
Testing has shown that the chatbot shows a “pattern of apparent distress” when it is being asked to generate harmful content ...
Anthropic's latest feature for two of its Claude AI models could be the beginning of the end for the AI jailbreaking ...
Discover how Anthropic's Claude Code processes 1M tokens, boosts productivity, and transforms coding and team workflows. Claude AI workplace ...
The Claude AI models Opus 4 and 4.1 will only end harmful conversations in “rare, extreme cases of persistently harmful or ...
While an Anthropic spokesperson confirmed that the AI firm did not acquire Humanloop or its IP, that’s a moot point in an ...
Notably, Anthropic is also offering two different takes on the feature through Claude Code. First, there's an "Explanatory" ...
Can exposing AI to “evil” make it safer? Anthropic’s preventative steering with persona vectors explores controlled risks to ...
In May, Anthropic implemented “AI Safety Level 3” protection alongside the launch of its new Claude Opus 4 model. The ...
Dario Amodei said he believes Anthropic employees are largely staying because of "true belief in the mission and belief in ...