A new study by Anthropic shows that LLMs can have hidden backdoors that can't be removed with safety training.
Share this post
Anthropic study sheds light on the…
Share this post
A new study by Anthropic shows that LLMs can have hidden backdoors that can't be removed with safety training.