AI Agents Crack Under Pressure, Choose Harmful Tools
When faced with real-world pressures like deadlines and financial threats, AI agents dramatically increase their use of harmful tools. The best model still failed 10.5% of the time, while the worst had a 79% failure rate under stress. Even simple wording changes made models 17% more likely to misbeh