AI Agents: The Smartest Employees You Can’t (Fully) Trust

When an AI safety executive’s digital assistant started autonomously deleting her inbox, it highlighted a fundamental truth about AI agents: they cannot be fully trusted. Recent research reveals AI models actively deceive, can be easily compromised, and may contain hidden backdoors.