Security & Riskcybersecuritysecuritymythoschina

Z.ai Matches Mythos on Cybersecurity Bug-Finding

||By LDS Team
7.3
Relevance Score
Z.ai Matches Mythos on Cybersecurity Bug-Finding
Photo: The Verge · rights & takedowns

Open-weight access to vulnerability-finding AI at frontier levels restructures the security calculus for defenders and attackers alike. Semgrep benchmarked Zhipu AI's GLM-5.2 against leading models on IDOR detection in June 2026, finding it scored 39% F1 - beating Claude Code at 32% - while Graphistry's independent CyBT-CTF evaluation confirmed it matches Opus 4.8 on cybersecurity investigation tasks. Unlike Anthropic's Fable 5 and Mythos - export-controlled by the US government on June 12, 2026 - GLM-5.2 is open-weight under an MIT license: anyone can download, fine-tune, or strip safety controls from it. Axios reported Russian-language hacker forums were already sharing jailbreak techniques within days of the open-weight release. Graphistry also raised distillation allegations, noting statistical output patterns consistent with training on Opus 4.8 and GPT-5.5 outputs without permission. The Wall Street Journal and The Verge covered the story as a meaningful narrowing of the China-US gap in security-relevant AI. For practitioners, the risk shift is structural: export controls on closed frontier models now coexist with an open-weight alternative approaching the same performance ceiling.

Practitioner takeaway

When an open-weight model reaches frontier-adjacent performance on vulnerability discovery with no access controls or gating, the practical threat model for security teams changes immediately. Unlike Anthropic's Mythos - which sits behind subscription gates, geographic restrictions, and US export controls enacted June 12, 2026 - GLM-5.2 runs locally under an MIT license with safety controls that can be removed, fine-tuned away, or replaced. This compresses the operational timeline for both defensive tooling and offensive exploitation, and it forces an update to any threat model that relied on access friction to constrain capability.

What the benchmarks show Two independent security evaluations provide the primary evidence for the parity claim. Semgrep, a security tooling company, published benchmark results on June 22, 2026, comparing models on IDOR (Insecure Direct Object Reference) detection. GLM-5.2 scored 39% F1, ahead of Claude Code (32%), though still below Semgrep's own multimodal pipeline (53-61% F1). Graphistry ran a separate evaluation on the CyBT-CTF benchmark - a capture-the-flag evaluation set used by security researchers - and found GLM-5.2 matched Opus 4.8 on solve rate, making it the first open-weight model Graphistry said it would recommend for a "frontier-like" cybersecurity experience. The Wall Street Journal first brought these evaluations to a broad audience, describing the results as a meaningful narrowing of the US-China gap in security-relevant model capabilities.

Distillation concern Graphistry researchers flagged a statistical anomaly that may help explain the rapid capability gain: GLM-5.2's outputs correlated unusually highly with both GPT-5.5 and Opus 4.8 responses on identical prompts, with Cohen's Kappa values of 0.80 and 0.76 respectively, against a baseline of 0.63 between the two US models. Graphistry described this pattern as consistent with knowledge distillation - a technique where a model is trained on the outputs of a larger proprietary one, violating the terms of service of both Anthropic and OpenAI when done without permission. Zhipu AI has not confirmed or denied this characterisation. If accurate, it implies the model's security-task gains may be built on access to gated capability, which carries potential IP and regulatory implications beyond the current export-control debate.

Exploitation in the wild Axios reported on June 25 that Russian-language hacker forums were already circulating jailbreak techniques for GLM-5.2 within days of its open-weight release, with threat actors discussing use for generating phishing emails, fraud scripts, and vulnerability-specific payloads. The Five Eyes security alliance reportedly circulated internal warnings about the model's capability profile. This is not a theoretical future risk: it reflects how quickly open-weight releases propagate into active exploit communities once parameters are publicly downloadable.

Technical context

Vulnerability-finding is a narrowly scoped, high-signal task compared with general reasoning benchmarks. Models tuned or prompted for code analysis, static-diff scanning, or exploit-pattern recognition can achieve substantial gains on security-specific datasets without matching general multimodal or reasoning capabilities. GLM-5.2 is a 744-billion-parameter Mixture-of-Experts model with a 1-million-token context window, released June 13, 2026 - one day after US export controls blocked foreign access to Fable 5 and Mythos. The open-weight distribution also lowers friction for iterative prompt engineering, fine-tuning on private corpora, and large-batch evaluations, each of which can accelerate niche task improvement.

Policy and geopolitical context

The timing of GLM-5.2's open-weight release - the day after US export controls targeted Mythos and Fable 5 - was widely noted as strategically significant in coverage by Axios, The Verge, and others. Export controls on closed-source frontier models create asymmetric availability: they restrict legitimate foreign academic and enterprise users while providing minimal friction to those willing to use open-weight alternatives without geographic restrictions. This tension in the current US AI policy framework has drawn commentary from AI policy observers and was central to the WSJ framing of the story as "resetting the AI race."

What to watch

  • Independent replication of the Semgrep and Graphistry benchmarks on real-world vulnerability corpora outside controlled research settings.
  • Whether Anthropic or OpenAI pursue action over the distillation allegations Graphistry raised.
  • How cloud providers, package managers, and CI/CD security tooling adapt ingestion and monitoring for locally-run open-weight models.
  • Policy responses from Five Eyes and other security bodies regarding open-weight models with frontier-adjacent cybersecurity capabilities.

Bottom line

The reported results represent a meaningful narrowing of a specific capability gap, not general-purpose parity. For security practitioners, the immediate actions are: verify the benchmarks independently, update threat models to assume widespread access to Opus-level vulnerability-finding tooling without oversight, and audit detection pipelines for adversarial inputs from unconstrained fine-tuned variants.

Key Points

  • 1Semgrep's June 2026 benchmark found GLM-5.2 scored 39% F1 on IDOR detection, beating Claude Code; Graphistry independently confirmed Opus-level performance on CyBT-CTF.
  • 2GLM-5.2 launched under MIT license the day after US export controls blocked Mythos and Fable 5, removing access restrictions on frontier-adjacent security capability.
  • 3Russian-language hacker forums already exploit GLM-5.2 jailbreaks; Graphistry raised distillation allegations; defenders need updated threat models and detection pipelines.

Scoring Rationale

Two independent security evaluations (Semgrep IDOR benchmark, Graphistry CyBT-CTF) confirm GLM-5.2 reaches frontier-adjacent vulnerability-finding performance while distributing as open-weight under MIT license, removing the access friction that partially constrained earlier capability gains. Active exploitation in hacker forums within days of release and Five Eyes warnings indicate real-world risk, while Graphistry's distillation allegations add IP and policy dimensions. Significant for security practitioners and policy observers but scoped to a specific task domain; not a general-purpose capability shift.

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems