Researchllmexploit developmentoffensive security

LLM Agents Generate QuickJS Exploit Chains

|January 18, 2026|By LDS Team

8.6

Relevance Score

LLM Agents Generate QuickJS Exploit Chains

A researcher ran experiments using Opus 4.5 and GPT-5.2 that produced over 40 distinct exploits for a zero-day QuickJS vulnerability across six scenarios, with code and write-ups published on GitHub. GPT-5.2 solved every scenario and Opus 4.5 solved all but two, with typical runs limited to 30M tokens (about $30) and the hardest task requiring ~50M tokens and three hours, implying token throughput could industrialize offensive cyber capabilities.

Key Points

1Show agents produced 40+ QuickJS exploits across six scenarios using Opus 4.5 and GPT-5.2.
2Indicate LLMs can search complex exploit spaces and autonomously verify solutions with fast feedback.
3Imply intrusion could industrialize, making token throughput the limiting factor for offensive cyber operations.

Scoring Rationale

High practical novelty and broad industry impact, tempered by single-source, non-peer-reviewed experimental evidence and limited replication.

Sources

Public references used for this report.

1 source

01sean.heelan.ioOn the Coming Industrialisation of Exploit Generation with LLMs

Practice with real Logistics & Shipping data

90 SQL & Python problems · 15 industry datasets

Used by DS/ML engineers at top companies

High-Value Overnight OrdersEasy

Delivered International ShipmentsMedium

On-Time Delivery Rate by CarrierHard

250 free problems · No credit card

See all Logistics & Shipping problems

Researchllmexploit developmentoffensive security

LLM Agents Generate QuickJS Exploit Chains

|January 18, 2026|By LDS Team

8.6

Relevance Score

Key Points

1Show agents produced 40+ QuickJS exploits across six scenarios using Opus 4.5 and GPT-5.2.
2Indicate LLMs can search complex exploit spaces and autonomously verify solutions with fast feedback.
3Imply intrusion could industrialize, making token throughput the limiting factor for offensive cyber operations.

Scoring Rationale

High practical novelty and broad industry impact, tempered by single-source, non-peer-reviewed experimental evidence and limited replication.

Sources

Public references used for this report.

1 source

01sean.heelan.ioOn the Coming Industrialisation of Exploit Generation with LLMs

Practice with real Logistics & Shipping data

90 SQL & Python problems · 15 industry datasets

Used by DS/ML engineers at top companies

High-Value Overnight OrdersEasy

Delivered International ShipmentsMedium

On-Time Delivery Rate by CarrierHard

250 free problems · No credit card

See all Logistics & Shipping problems

LLM Agents Generate QuickJS Exploit Chains

Key Points

Scoring Rationale

Sources

More AI & Data Science News

White House Adviser Rules Out FDA-Style AI Regulator

Consumers Pay More as Electronics Component Prices Rise

Anthropic launches Claude Science for drug discovery

Safari Enables AI Debugging With New MCP Server

LLM Agents Generate QuickJS Exploit Chains

Key Points

Scoring Rationale

Sources

More AI & Data Science News

White House Adviser Rules Out FDA-Style AI Regulator

Consumers Pay More as Electronics Component Prices Rise

Anthropic launches Claude Science for drug discovery

Safari Enables AI Debugging With New MCP Server