Security & Riskagent securitythreat detectionopen standardsprompt injection

Agent Threat Rules Defines Open Detection Format for Agent Attacks

||By LDS Team
7.0
Relevance Score
Agent Threat Rules Defines Open Detection Format for Agent Attacks

Agent Threat Rules (ATR) is a new open detection format for AI agent security threats, reported by Help Net Security. ATR rules are YAML documents that follow a versioned schema; each declares the attack pattern it matches, the input it inspects (such as LLM input, tool-call arguments, or SKILL.md content), and test cases that prove it works. The project carries more than 400 rules across categories including prompt injection, agent manipulation, skill compromise, and context exfiltration, and draws on the Sigma (SIEM) and YARA (malware) rule traditions. A reference engine in TypeScript and a Python wrapper, pyATR, evaluate the rules under an MIT license. Per Help Net Security, four organizations already run or have merged ATR, including Microsoft's Agent Governance Toolkit, Cisco AI Defense, MISP at CIRCL, and Gen Digital. The project openly documents weak recall against paraphrased attacks and advises pairing it with sandboxing and human review.

What happened

Help Net Security reports the release of Agent Threat Rules (ATR), an open detection-rule format for AI agent security threats. ATR rules are YAML documents that conform to a versioned schema, and each rule declares the attack pattern it matches, the input field it inspects (such as LLM input, tool-call arguments, or SKILL.md content), and the test cases that demonstrate it works. A reference engine written in TypeScript and a Python wrapper called pyATR evaluate the rules, both under the MIT license. The project carries more than 400 rules across categories including prompt injection, agent manipulation, skill compromise, and context exfiltration, and it borrows from Sigma, the SIEM detection standard, and YARA, the malware pattern language.

Why it matters

AI agents run inside coding assistants, MCP servers, and multi-agent frameworks, and the access that makes them useful also opens paths to prompt injection, tool poisoning, and credential theft. Help Net Security notes that public CVE feeds already carry agent-execution flaws that reach production faster than the tooling built to catch them. A shared, machine-readable rule format, with precedent in Sigma and YARA, can lower the cost of exchanging detections across security tools and speed operational response.

Adoption and standards mapping

Per Help Net Security, four organizations already run or have merged ATR: Microsoft's Agent Governance Toolkit carries a rule pack that auto-syncs weekly, Cisco AI Defense runs a rule pack in production, MISP at CIRCL merged a threat-intel cluster, and Gen Digital (parent of Norton, Avast, and AVG) merged a rule pack. The report notes adopters self-declare by pull request without maintainer pre-approval. The rule set is reported to cover all 10 OWASP Agentic Top 10 categories and 78 of 85 SAFE-MCP techniques, with individual rules referencing CVEs affecting Microsoft Semantic Kernel, Spring AI, LiteLLM, and Claude Code.

Stated limitations

The project publishes version-pinned benchmark numbers and is candid about gaps. Per Help Net Security, recall is high on structured, in-the-wild jailbreak corpora (about 98% on NVIDIA garak's in-the-wild set) but falls sharply on paraphrased or semantically rephrased attacks, with several academic adversarial sets registering low single-digit or zero recall. The team attributes this to the limits of a regex-based matching layer and recommends pairing ATR with credential brokering, sandboxed execution, and human review for high-risk actions.

What to watch

  • Whether more vendors integrate ATR into EDR, SOC, and agent-runtime controls.
  • How the format maps to adjacent standards such as Sigma, STIX, and SAFE-MCP over time.
  • Whether semantic or model-based matching is added to close the paraphrase-recall gap.

Key Points

  • 1ATR is a versioned, YAML-based open standard for AI-agent threat detection, modeled on Sigma and YARA, with 400+ rules, an MIT-licensed reference engine, and a pyATR Python wrapper.
  • 2Per Help Net Security, it already runs in production tooling, including Microsoft's Agent Governance Toolkit, Cisco AI Defense, MISP, and Gen Digital, and maps to all 10 OWASP Agentic Top 10 categories.
  • 3The project is transparent about limits: regex-based rules catch structured attacks but show low recall on paraphrased ones, so it advises layering with sandboxing and human review.

Scoring Rationale

A new open, YAML-based detection standard for AI-agent attacks is a meaningful development for security and MLOps teams, especially given reported production use at Microsoft, Cisco, MISP, and Gen Digital and full coverage of the OWASP Agentic Top 10. It is a practical tooling and standards story rather than a foundational research advance, and the project is candid about real recall gaps, so it lands as notable and broadly useful rather than industry-shaking.

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems