Researchers Release Agentic Safety Framework With Dataset

A research team from NVIDIA and Lakera AI has released a safety and security framework for agentic systems, including an operational taxonomy, dynamic evaluation method, and a case study of NVIDIA’s AI-Q Research Assistant. The authors also published a dataset of more than 10,000 attack and defense traces and over 6,000 risk measurements to support continuous probing and safer enterprise deployments.
Key Points
- 1Release of framework, taxonomy, dynamic evaluation, and dataset with over 10,000 attacker/defender traces.
- 2Highlights emergent system-level risks from tool interactions, memory, retrieval, and agentic plan variability.
- 3Encourages continuous probing, layered defenses, observability, and operational traces for safer agentic deployments.
Scoring Rationale
Comprehensive, reproducible framework and dataset empower practitioners; however, early-stage adoption limits immediate industry-wide mandate and measurable outcomes.
Sources
Public references used for this report.
Practice with real FinTech & Trading data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all FinTech & Trading problems
