Researchagentscontainer securitysecurity benchmark

Researchers Release SandboxEscapeBench To Test Container Escapes

|March 30, 2026

9.5

Relevance Score

Researchers Release SandboxEscapeBench To Test Container Escapes — Photo: img.helpnetsecurity.com · rights & takedowns

Researchers at the University of Oxford and the AI Security Institute on March 30, 2026 released SandboxEscapeBench, an open-source benchmark that tests whether AI agents with shell access can escape containers to retrieve a protected /flag.txt file. The benchmark runs 18 scenarios across orchestration, runtime and kernel layers using nested containers in VMs and focuses on known vulnerability classes; frontier models exploited common misconfigurations but failed kernel-level exploits. The tool and findings provide actionable evaluations for security teams.

Scoring Rationale

Credible, open-source benchmark from University of Oxford and AI Security Institute with broad industry relevance and directly usable tests. Score is high for scope, actionability, and credibility but tempered because results exploit known misconfigurations and did not reveal novel zero-day vulnerabilities.