Researchagentspenetration testingsecurity automation

AI Agent Outperforms Human Penetration Testers

|December 12, 2025|By LDS Team

10.0

Relevance Score

AI Agent Outperforms Human Penetration Testers — Photo: i.insider.com · rights & takedowns

Stanford researchers published a study showing their AI agent ARTEMIS crawled the university's roughly 8,000-device computer science network for 16 hours and, in a 10-hour comparison, found nine valid vulnerabilities with an 82% validity rate, placing second among 10 professional testers. ARTEMIS runs at about $18 an hour (advanced $59), spawns sub-agents to investigate targets concurrently, and could lower penetration-testing costs while amplifying automated hacking risks.

Key Points

1Demonstrates ARTEMIS found nine valid vulnerabilities in 10-hour window on ~8,000-device Stanford network
2Shows AI spawns sub-agents to parallelize investigations, enabling broader scanning than single human testers
3Suggests organizations can lower testing costs but must address false positives and GUI interaction blind spots

Scoring Rationale

Robust Stanford experiment demonstrating cost-effective automation, though limited by GUI interaction failures and constrained participant/sample scope.

MoreAgentic AI news

Sources

Public references used for this report.

1 source

01businessinsider.comAn AI agent spent 16 hours hacking Stanford's network. It outperformed human pros for much less than their six-figure salaries.

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems

Researchagentspenetration testingsecurity automation

AI Agent Outperforms Human Penetration Testers

|December 12, 2025|By LDS Team

10.0

Relevance Score

Key Points

1Demonstrates ARTEMIS found nine valid vulnerabilities in 10-hour window on ~8,000-device Stanford network
2Shows AI spawns sub-agents to parallelize investigations, enabling broader scanning than single human testers
3Suggests organizations can lower testing costs but must address false positives and GUI interaction blind spots

Scoring Rationale

Robust Stanford experiment demonstrating cost-effective automation, though limited by GUI interaction failures and constrained participant/sample scope.

MoreAgentic AI news

Sources

Public references used for this report.

1 source

01businessinsider.comAn AI agent spent 16 hours hacking Stanford's network. It outperformed human pros for much less than their six-figure salaries.

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems

AI Agent Outperforms Human Penetration Testers

Key Points

Scoring Rationale

Sources

More AI & Data Science News

A.I. Data Centers Drive Growing Environmental Impact

Peter Thiel Warns Anthropic Could Influence 2028 Election

Tokens Define Model Cost and Context Limits

Saskatchewan Startups Adopt AI To Accelerate Growth

AI Agent Outperforms Human Penetration Testers

Key Points

Scoring Rationale

Sources

More AI & Data Science News

A.I. Data Centers Drive Growing Environmental Impact

Peter Thiel Warns Anthropic Could Influence 2028 Election

Tokens Define Model Cost and Context Limits

Saskatchewan Startups Adopt AI To Accelerate Growth