Security & Riskmicrosoft mdashmicrosoftvulnerability discoveryagentic ai

Microsoft's MDASH Finds 16 Windows Vulnerabilities

|May 14, 2026|By LDS Team

8.0

Relevance Score

Microsoft's MDASH Finds 16 Windows Vulnerabilities — Photo: anoopcnair.com · rights & takedowns

Microsoft announced a new multi-model, agentic vulnerability discovery system codenamed MDASH that helped uncover 16 previously unknown Windows vulnerabilities, including four critical remote code execution (RCE) flaws, according to Microsoft's security blog and reporting by CSO and SiliconANGLE. The flaws, which affected components including tcpip.sys, IKEEXT, netlogon.dll, dnsapi.dll, and http.sys, were fixed in Microsoft's May 12 Patch Tuesday updates, per CSO. Microsoft also published benchmark results showing MDASH scored 88.45% on the public CyberGym benchmark and detected all 21 planted issues in a private test driver with zero false positives, according to Microsoft's blog and SiliconANGLE. Industry observers say this is an early signal that agentic, multi-model systems can accelerate vulnerability discovery and increase patch volume, changing how security teams prioritize testing and deployment.

For security and platform teams, the notable signal is not just the 16 patched bugs, it is that Microsoft is now running an agentic, multi-model system at production scale against its own codebase and publicly benchmarking it against rivals. That shifts vulnerability discovery from a linear, headcount-bound process toward one where scanning throughput and benchmark performance become vendor differentiators, and where defenders should expect faster, larger patch batches as more vendors adopt similar pipelines.

What happened

Microsoft announced a new multi-model, agentic vulnerability discovery system codenamed MDASH, and the system contributed to identifying 16 previously unknown Windows vulnerabilities, including four critical remote code execution flaws, according to Microsoft's security blog and coverage in CSO and SiliconANGLE. The vulnerabilities were patched in Microsoft's May 12 Patch Tuesday release, and affected components named in reporting include tcpip.sys, IKEEXT, netlogon.dll, dnsapi.dll, and http.sys, per CSO. Microsoft says MDASH will open to enterprise customers in private preview in June.

Technical context

Per Microsoft's blog, MDASH is a "multi-model agentic scanning harness" that orchestrates more than 100 specialized AI agents across an ensemble of frontier and distilled models to find, validate, and reproduce exploitable bugs. Microsoft published internal benchmark results showing MDASH achieved 100% detection on a private test driver called StorageDrive with 21 planted vulnerabilities and zero false positives, and reported 96% recall on clfs.sys and 100% recall on tcpip.sys across five years of confirmed Microsoft Security Response Center cases, per SiliconANGLE and Microsoft's announcement. On the public CyberGym benchmark, Microsoft reported a score of 88.45%, which several outlets, including Neowin and GeekWire, described as topping Anthropic's Mythos security-research system on the same leaderboard.

Industry context

The disclosure landed alongside other vendors' AI-assisted bug hunts: SecurityWeek reported that Palo Alto Networks used Anthropic's Mythos system to find dozens of flaws in its own code, and The Register described a broader "vulnpocalypse" pattern in which Palo Alto fixed 75 flaws this month versus its usual five. CSO and SiliconANGLE also named two of the higher-severity CVEs MDASH surfaced: CVE-2026-33827, a remote use-after-free in the IPv4 stack, and CVE-2026-33824, a double-free in IKEEXT. For enterprise defenders, public disclosure of exploitable, network-reachable component bugs raises the operational importance of rapid patching and staged validation, and industry-wide adoption of agentic scanning is likely to keep increasing both the volume and technical complexity of vendor disclosures.

For practitioners

Enrich testing pipelines with automated validation steps that can reproduce vendor-provided exploit proofs, and track vendor-disclosed CVEs such as CVE-2026-33827 and CVE-2026-33824 through standard feeds. As AI-driven discovery increases the volume of findings across vendors, teams should evaluate automation for triage, regression testing, and staged rollouts rather than relying solely on manual review.

What to watch

Watch for independent, third-party validation of Microsoft's benchmark claims, including replications on public datasets and audits of false-positive and false-negative rates, since the CyberGym score is currently a vendor-reported comparative metric. Also watch whether other vendors publish similar agentic scanning systems, whether third-party security teams adopt multi-agent pipelines of their own, and for any exploit attempts against the newly patched CVEs that would change patch-prioritization guidance.

Key Points

1Microsoft's agentic system MDASH helped find 16 Windows vulnerabilities, including four critical RCEs, per Microsoft and CSO.
2MDASH uses over 100 specialized AI agents and scored 88.45% on the CyberGym benchmark, ahead of Anthropic's Mythos per several outlets.
3Industry-wide adoption of AI-driven bug hunting is raising both the volume and complexity of vendor disclosures, per SecurityWeek and The Register.

Scoring Rationale

A major vendor (Microsoft) deployed an agentic, multi-model system in production against its own codebase, surfacing 16 real vulnerabilities including 4 critical RCEs with assigned CVEs, corroborated by Microsoft's own blog and seven-plus independent outlets, and cross-referenced against a parallel disclosure from Palo Alto Networks using Anthropic's Mythos. This has direct, immediate operational impact for security and patch-management teams industry-wide.

MoreMicrosoft news

Sources

Primary source and supporting public references used for this report.

7 sources

Primary sourceanoopcnair.com16 New Windows Vulnerabilities Discovered By Microsoft’s AI-Powered Agentic Security System HTMD Blog

View 6 more sources

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems