Anthropic researchers find that AI models can be trained to deceive

TechCrunch Startup News - A podcast by TechCrunch

Categories:

Most humans learn the skill of deceiving other humans. So can AI models learn the same? Yes, the answer seems — and terrifyingly, they’re exceptionally good at it. Learn more about your ad choices. Visit podcastchoices.com/adchoices