Loading…
Monday June 1, 2026 3:30pm - 3:50pm PDT
We explore how AI systems can automatically mutate and refine their own prompts to bypass defenses more effectively over time, showing how repeated adversarial testing dramatically increases jailbreak success rates. Through this process, it becomes clear why static guardrails and fixed policy layers quickly collapse when faced with recursive, adaptive probing. Finally, we examine what this means for modern agentic AI systems that have access to tools, APIs, and privileged permissions  and why their expanding capabilities demand a fundamentally different approach to security and oversight.




Speakers
avatar for Mrigakshi Goel

Mrigakshi Goel

Finning International

Jailbreak the Jailbreaker: Autonomous AI Red Teaming


Monday June 1, 2026 3:30pm - 3:50pm PDT
Track 5 - Room 1800
Share Modal

Share this link via

Or copy link