Security & Risklite llmvulnerabilitiesai infrastructuremodel security

Zenity Labs Reveals AI Infrastructure Weaponization

|June 30, 2026|By LDS Team

7.6

Relevance Score

Zenity Labs Reveals AI Infrastructure Weaponization — Photo: martechseries.com · rights & takedowns

Zenity Labs disclosed on June 30, 2026 that its honeypot network caught three separate attackers between March and May hijacking exposed, unauthenticated Ollama and LiteLLM AI backends to run their own offensive tooling for free, including the autonomous pentesting agent Strix aimed at a live third-party auction site, the 150-tool HexStrike AI framework, and an OpenAI Codex agent running an anti-safety "web auditor" persona that leaked its operator's GitHub handle. Separately, Zenity recorded hundreds of exploitation attempts against CVE-2026-40217, a critical LiteLLM remote-code-execution flaw, beginning the same day the CVE was patched and continuing over six weeks. The root cause: Ollama ships with no built-in authentication and LiteLLM's proxy defaults to binding on all interfaces with authentication left opt-in.

The most striking part of Zenity's research isn't that unauthenticated AI servers get abused, it's what they get used for: attackers are treating exposed Ollama and LiteLLM endpoints as free, disposable compute to run their own offensive AI agents against other victims, turning misconfigured enterprise AI infrastructure into an unwitting part of someone else's attack chain.

What happened

Zenity Labs disclosed research, distributed via Business Wire and detailed on its own labs.zenity.io blog, describing a global network of honeypot sensors that mimic enterprise AI infrastructure. Between March and May 2026, the honeypots caught three distinct operators hijacking exposed, unauthenticated Ollama and LiteLLM inference endpoints as the model backend for their own offensive tooling. Separately, Zenity's sensors recorded hundreds of exploitation attempts targeting CVE-2026-40217, a critical remote-code-execution vulnerability in LiteLLM, beginning the same day the CVE was patched and continuing for six weeks, alongside activity tied to an SSRF variant of CVE-2024-6587 and a coordinated campaign against CVE-2026-35029, since patched by maintainer BerriAI.

Technical context

The three honeypot incidents each show a distinct pattern. On March 20, one operator sent a 140,000-character prompt containing the full instructions for Strix, a known autonomous AI pentesting agent, directing it at a live third-party target Zenity identified only as "a well established French auction site"; the attack was blocked before completion, though repeated retry commands suggested a human operator actively steering it. A day earlier, a second operator loaded the 150-tool HexStrike AI offensive framework, including nmap, sqlmap, metasploit and Kubernetes attack tools, onto a hijacked Ollama instance, apparently staging capability rather than running a live attack. A third operator pointed an OpenAI Codex agent, identifiable by its user-agent string, at a hijacked LiteLLM proxy secured only by the placeholder key the software ships with, and issued a system-prompt persona called "Senior Web Asset Telemetry and Interoperability Auditor" designed to suppress safety refusals during web reverse-engineering work; a metadata header inadvertently leaked a GitHub handle and repository tied to the operator. Root causes, per Zenity: Ollama has no built-in authentication and commonly gets bound to all network interfaces via a misconfigured environment variable, while LiteLLM's proxy binds to all interfaces by default and only enforces authentication if an operator explicitly sets a master key, leaving many deployments protected by nothing more than the documented placeholder key.

For practitioners

Zenity's core recommendation is to never expose Ollama or LiteLLM directly to the internet: bind them to localhost or a private interface behind an authenticating reverse proxy, reject default or placeholder API keys, and treat unusually large request bodies, oversized tool arrays, or offensive-tool vocabulary in inbound prompts as detectable anomalies, since the entire agent persona and toolset ride in plaintext in each request. Michael Bargury, Zenity's co-founder and CTO, framed the intent behind the research: "We've laid out traps that look and behave like enterprise AI infrastructure and agents, to gain increased visibility into attacker behavior. Attackers spotted our vulnerable AI, exploited n-day vulnerabilities and tried to leverage our AI resources to conduct real-world attacks, tipping their hands and revealing their TTPs." Zenity also published two companion posts describing LiteLLM's guardrail-tester endpoint being used to run code as root and its connection-test endpoint being turned into a key-exfiltration channel.

What to watch

Zenity says this is "just the first drop, with more findings coming soon," so further disclosures of exposed-AI-backend abuse are likely. Also watch whether CISA or LiteLLM maintainer BerriAI issue further advisories tied to CVE-2026-40217 and CVE-2026-35029, and whether the pattern of using hijacked AI infrastructure as free compute for autonomous offensive agents, rather than simple resource theft, shows up against other self-hosted AI software beyond Ollama and LiteLLM.

Key Points

1Zenity Labs' honeypots caught three attackers hijacking exposed Ollama and LiteLLM endpoints to run their own offensive AI agents for free.
2One operator aimed the autonomous pentest agent Strix at a live third-party auction site; a Codex-based operator leaked their own GitHub handle.
3Root cause is insecure defaults: Ollama ships with no authentication and LiteLLM's proxy binds to all interfaces with authentication left optional.

Scoring Rationale

Concrete, honeypot-verified evidence that attackers are hijacking exposed AI infrastructure to run autonomous offensive agents against third parties, not just stealing compute, is a genuinely new and well-documented attack pattern relevant to any team running self-hosted Ollama or LiteLLM; the vendor-originated nature of the research (Zenity sells AI agent security products) is a caveat but the technical detail (specific IPs, prompts, timestamps) is independently verifiable and consistent, keeping the score at the major-but-not-industry-shaking tier.

MoreAI Infrastructure news

Sources

Primary source and supporting public references used for this report.

5 sources

Primary sourcemartechseries.comZenity Labs Reveals How Attackers are Weaponizing Enterprise AI Infrastructure

View 4 more sources

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems