Continuously hardening ChatGPT Atlas against prompt injection
OpenAI is strengthening ChatGPT Atlas against prompt injection assaults utilizing automated purple teaming skilled with reinforcement studying. This proactive discover-and-patch loop helps determine novel exploits early and harden the browser agent’s defenses as AI turns into extra agentic.
