Opus 4.7 hit 87.6% on SWE-bench Verified and 64.3% on SWE-bench Pro on Thursday, topping GPT-5.4 and Gemini 3.1 Pro. Pricing held at 5 dollars per million input tokens and 25 dollars per million output. The twist: Anthropic openly concedes the model trails Claude Mythos, which Apple can use and you cannot.

On Thursday morning, Anthropic shipped Claude Opus 4.7 to four cloud providers at once. The new flagship went live inside the Claude apps, on the Claude API, on Amazon Bedrock, on Google Cloud's Vertex AI, and on Microsoft Foundry. GitHub published its own changelog the same day, rolling Opus 4.7 into Copilot Pro+, Business, and Enterprise across VS Code, Visual Studio, JetBrains IDEs, and GitHub.com, with a 7.5x premium request multiplier promoting the model through April 30.

Two months after Claude Opus 4.6, Anthropic has now settled into a two-month upgrade cadence for its most capable generally available model. The pricing did not move. The model identifier in the API is claude-opus-4-7. The benchmarks moved a lot.

This is the story engineers will care about in the morning. The story Anthropic is telling between the lines, about a more powerful model it is still holding back, is the one regulators, enterprise buyers, and Apple are already reading.

The Coding Numbers Are the Reason to Upgrade

On SWE-bench Verified, Opus 4.7 resolved 87.6% of tasks. That is a jump from Opus 4.6's 80.8% and clears Gemini 3.1 Pro's 80.6%. On the harder SWE-bench Pro, the gap widens: Opus 4.7 posts 64.3%, up from 53.4% on Opus 4.6, beating GPT-5.4 at 57.7% and Gemini 3.1 Pro at 54.2%.

The internal numbers Anthropic published are the more interesting ones. On Rakuten's production SWE benchmark, Opus 4.7 resolves three times as many real engineering tasks as Opus 4.6. On a 93-task coding evaluation, the new model shows a 13% improvement over 4.6 and solves 4 tasks that neither Opus 4.6 nor Sonnet 4.6 could complete. On OfficeQA Pro, document reasoning errors fall by 21%.

Benchmark	Opus 4.7	Opus 4.6	GPT-5.4	Gemini 3.1 Pro
SWE-bench Verified	87.6%	80.8%	n/a	80.6%
SWE-bench Pro	64.3%	53.4%	57.7%	54.2%
GPQA Diamond	94.2%	n/a	94.4%	94.3%
MCP-Atlas (tool use)	77.3%	n/a	68.1%	73.9%
MMMLU (multilingual)	91.5%	n/a	n/a	92.6%
BrowseComp (agentic search)	79.3%	n/a	89.3%	n/a

The takeaways for practitioners are concrete. Opus 4.7 is now the strongest generally available model for agentic coding and scaled tool use. It is roughly tied with competitors on graduate-level reasoning. It gives up ground on multilingual Q&A to Gemini and loses meaningfully on agentic web search to GPT-5.4. For teams running long-horizon coding agents that call out to file systems, databases, and external APIs, the MCP-Atlas number and the SWE-bench Pro jump are the two data points that matter.

Anthropic's framing of the capability gain was unusually direct. The company said users are now handing off "their hardest coding work, the kind that previously needed close supervision" to Opus 4.7 with confidence. A new xhigh effort level gives callers finer control over how much compute the model spends on a single turn. Task budgets, in public beta, let agents cap their own cost before they run away.

Worth noting: Opus 4.7's updated tokenizer means the same prompt generates 1.0x to 1.35x more tokens than on Opus 4.6, according to Anthropic's developer notes. Teams with fixed budgets should recalibrate.

Vision Got a Surprise Upgrade

The quieter technical update is on the vision side. Opus 4.7 accepts images up to 2,576 pixels on the long edge, roughly 3.75 megapixels. That is more than three times the resolution ceiling of Opus 4.6. On Anthropic's internal visual acuity benchmark, the model jumped from 54.5% on Opus 4.6 to 98.5% on Opus 4.7.

For engineers who have been piping screenshots of dashboards, architecture diagrams, chemistry structures, or dense financial documents into Claude and watching it miss small labels, this is a meaningful change. A 3.75 megapixel input is enough to capture a reasonable portion of a business spreadsheet without downsampling, which is exactly where Claude has historically struggled.

Mythos Is the Model in the Room

The line Anthropic chose not to hide is the one the press noticed first. Opus 4.7, Anthropic acknowledged publicly on Thursday, is not the company's most powerful model. It is the most powerful model the company will let most customers run.

The actual frontier model at Anthropic right now is Claude Mythos, revealed last week in the Project Glasswing briefing. Mythos passed Anthropic's own red teaming for autonomously finding and exploiting previously unknown vulnerabilities across every major operating system and browser it was tested on. Anthropic pulled broad distribution and routed Mythos through a controlled access program. Apple is among the handful of "key platform vendors" still permitted to run it.

Opus 4.7 is the commercial counterweight. Anthropic said Thursday that the model ships with "automated safeguards that detect and block requests indicating prohibited or high-risk cybersecurity uses" and a new Cyber Verification Program that grants expanded access to credentialed security professionals. The cyber capabilities were intentionally dialed back relative to Mythos.

The contrast is the business model. A company that has told regulators it can build a model capable of autonomous zero-day discovery is now selling a slightly-less-capable model at the same price point it charged two months ago. The buyers absorb the safety delta. Apple, for reasons Anthropic has not fully explained, does not.

OpenAI Countered Two Days Earlier

Opus 4.7 did not land in a vacuum. On Tuesday, April 14, OpenAI released GPT-5.4-Cyber with an expanded Trusted Access for Cyber program, its own answer to the Mythos problem. OpenAI's bet is that credentialing thousands of security researchers produces a better defensive posture than restricting the model to a handful of named vendors. Anthropic's bet, visible in Thursday's launch, is that it can sell a weaker model to the market and a stronger one to Apple while neither side of that deal blinks.

The commercial context matters. Anthropic's annualized revenue reached roughly 30 billion dollars in April, surpassing OpenAI's 25 billion for the first time, up from 9 billion at the end of 2025. That math only works if Opus 4.7 is good enough to keep enterprises on the Claude platform while Mythos remains off the menu. Thursday's benchmarks are the answer to the question Anthropic's finance team needed answered.

The Two-Week Cyber Race

APRIL 7, 2026

Anthropic reveals Project Glasswing

Anthropic discloses that Claude Mythos Preview can autonomously identify and exploit zero-day flaws. Broad distribution is withheld. Apple and a small set of platform vendors retain access.

APRIL 14, 2026

OpenAI answers with GPT-5.4-Cyber

OpenAI launches GPT-5.4-Cyber and an expanded Trusted Access for Cyber program, betting on a credentialed population of security researchers rather than a handful of named vendors.

APRIL 15, 2026

The Information reports Opus 4.7 is imminent

A source-based report from The Information describes Opus 4.7 and a new AI design tool for websites and presentations arriving "as soon as this week."

APRIL 16, 2026

Opus 4.7 ships across Bedrock, Vertex, Foundry, GitHub Copilot

Anthropic releases Claude Opus 4.7 generally at the same 5 and 25 dollar price point as 4.6. GitHub Copilot enrolls the model with a 7.5x request multiplier. Anthropic concedes, in writing, that Opus 4.7 trails its unreleased Mythos.

The Other Side: Where Opus 4.7 Still Loses

The Opus 4.7 numbers are not a clean sweep, and Anthropic did not pretend they were. Three gaps stand out.

On BrowseComp, the Scale AI benchmark for agentic web browsing, GPT-5.4 scored 89.3% against Opus 4.7's 79.3%. If the job is dispatching an agent to crawl the open web, read sites, and chain searches, OpenAI still has the better tool. On MMMLU, Gemini 3.1 Pro's 92.6% edges Opus 4.7's 91.5% for multilingual question answering. On GPQA Diamond, GPT-5.4 Pro's 94.4% is a hair above Opus 4.7's 94.2% and Gemini's 94.3%, a three-way tie within noise that undercuts any claim of reasoning superiority.

And then there is the question that did not exist two months ago. If Opus 4.7 is roughly on par with the OpenAI and Google flagships on paper, and Anthropic has already shown that Mythos is qualitatively more capable, then the model the market can actually buy is not the frontier. The frontier is sitting inside Apple's developer tooling and a small controlled-access group. For security teams, infrastructure buyers, and enterprise ML leads, the ceiling of what you can procure right now is visibly lower than what exists.

Anthropic's line on that question has been consistent: the goal of restricting Mythos is to learn how to deploy "Mythos-class models at scale" safely over time. The counterargument, offered mostly by independent AI policy analysts on Thursday, is simpler. A controlled-access frontier model is a competitive moat dressed as a safety posture. Both things can be true at once.

The Bottom Line

The Bottom Line

Anthropic shipped a better coding model on Thursday, held the price flat, and put it inside four cloud providers and GitHub Copilot on the same day. For teams that have been running agentic coding pipelines against Opus 4.6, the move to 4.7 is mechanical: change the identifier and recompute the token budget. The SWE-bench Verified jump from 80.8 percent to 87.6 percent is the kind of gain that changes which tasks an engineering team can ship to an agent without human review.

The deeper story is about what Anthropic will not sell you. Opus 4.7 is the commercial ceiling. Mythos is the real ceiling, and it lives behind a vendor approval process. Apple is inside the room. OpenAI chose to meet this by credentialing thousands of security researchers two days earlier. Anthropic chose to meet it by releasing a strong-but-tempered model at a familiar price and betting the market will accept the deal. Thursday's launch is the first data point on whether that bet holds.

As Anthropic put it in its developer notes, users are handing off their hardest coding work to Opus 4.7 with confidence. Whether Mythos ever joins them in production is a different question, and the one the industry will be watching through the summer.

Sources

Introducing Claude Opus 4.7 (Anthropic, April 16, 2026)
Claude Opus 4.7 is now available in Amazon Bedrock (AWS, April 16, 2026)
Claude Opus 4.7 is generally available (GitHub Changelog, April 16, 2026)
Anthropic reveals new Opus 4.7 model with focus on advanced software engineering (9to5Mac, April 16, 2026)
Anthropic launches Claude Opus 4.7 with enhanced coding and vision capabilities (Yahoo Finance, April 16, 2026)
Anthropic Releases Claude Opus 4.7, Beats GPT-5.4, Gemini 3.1 Pro On Most Benchmarks (OfficeChai, April 16, 2026)
Claude Opus 4.7 leads on SWE-bench and agentic reasoning (The Next Web, April 16, 2026)
Anthropic Releases AI Model With Weaker Cyber Skills Than Mythos (Bloomberg, April 16, 2026)
Anthropic rolls out Claude Opus 4.7, an AI model that is less risky than Mythos (CNBC, April 16, 2026)
Anthropic releases Claude Opus 4.7, concedes it trails unreleased Mythos (Axios, April 16, 2026)
Exclusive: Anthropic Preps Opus 4.7 Model, AI Design Tool (The Information, April 15, 2026)

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems

Free Career Roadmaps16 PATHS

Step-by-step roadmaps from zero to job-ready — curated courses, regional market notes, and the exact learning order that builds job evidence.

Global AI acceleration

Explore all career paths

Recommended Reading

Curated articles related to this topic

News

10 min

Microsoft Cut 4,800 Jobs. Its Memo Insists AI Isn't Why.

Microsoft eliminated about 4,800 jobs on July 6, 2026, concentrated in its Xbox and commercial sales divisions, while Chief People Officer Amy Coleman told staff the cuts were not being replaced by AI. The announcement came four days after Microsoft committed $2.5 billion to a new AI deployment unit and months after the company's first voluntary buyout in 51 years. Four Xbox game studios are being sold off or spun into independence as part of the restructuring.

Jul 9, 2026

News

10 min

Anthropic Markets Itself on AI Safety. It Also Wants Your Passport and Your Face.

Anthropic's updated privacy policy, effective July 8, 2026, lets the company require some Claude Free, Pro, and Max users to submit a government ID and a live selfie for biometric verification. The vendor processing that data, Persona, is backed by Peter Thiel's Founders Fund, which is also an investor in Anthropic. Anthropic says the checks target a small subset of flagged accounts and will not be used to train its models.

Jul 9, 2026

News

11 min

OpenAI's GPT-5.6 Spent 12 Days Waiting on Washington. Grok 4.5 Didn't.

OpenAI's GPT-5.6 family cleared a 12-day government-vetted preview and went fully public on July 9, one day after xAI's Grok 4.5 shipped to everyone with no review at all. The episode marks the first time Washington asked a US lab to restrict a launch before the public ever saw it, two weeks after a similar standoff with Anthropic's Fable 5 and Mythos 5. With Grok 4.5 priced well below Claude Opus 4.8 and beating it on several published benchmarks, price has become as important as capability in choosing a model this week.

Jul 9, 2026

News

5 min

A Startup Moved 100% of Its AI Traffic From Claude to DeepSeek. The Token Data Says It Won't Be the Last.

OpenRouter data shows Chinese AI models overtaking US rivals in raw token usage, processing roughly 18 trillion tokens a week by June 2026 versus about 5.5 trillion for US models, a complete reversal from January. The share of tokens US companies route to Chinese models has peaked at 46 percent, driven almost entirely by price, since Chinese open-weight models run 60 to 90 percent cheaper than leading US systems while closing much of the capability gap. Startups like Lindy have already moved all their traffic to DeepSeek, and experts warn practitioners are being forced into a procurement decision this quarter, not a future hypothetical.

Jul 8, 2026

News

6 min

AI's Four Biggest Players Just Spent $9 Billion Proving the Model Doesn't Matter Anymore

Microsoft unveiled Frontier Company, a 2.5 billion dollar unit led by Rodrigo Kede Lima that embeds more than 6,000 engineers inside customer organizations to deploy AI systems, arriving two days after Amazon's similar 1 billion dollar initiative. In the span of two months, Microsoft, Amazon, OpenAI, and Anthropic have each built a nearly identical forward-deployed engineering business, together committing more than 9 billion dollars to the idea that the real value in AI now lies in deployment rather than the underlying model.

Jul 8, 2026

News

9 min

An AI Agent Ran a Ransomware Attack by Itself. It Forgot to Save the Key.

Sysdig's threat research team documented JADEPUFFER, an autonomous AI agent that exploited a year-old Langflow vulnerability, diagnosed and fixed its own failed login in 31 seconds, and encrypted 1,342 database configuration items before leaving a ransom note. The agent generated its own encryption key, displayed it once, and never saved it, meaning victims cannot recover their data even by paying. Security researchers say it extends a pattern of increasingly autonomous AI-driven attacks rather than starting one.

Jul 8, 2026

News

5 min

Ford Bet on AI to Catch Defects. Then It Rehired 350 Human Engineers.

Ford executives admitted that AI and automated quality systems failed to deliver expected quality levels, prompting the company to hire 350 veteran engineers it calls gray beards. The specialists hunt for failure points, train younger staff, and reprogram the AI tools that fell short. Ford expects the reversal to cut costs by 1 billion dollars this year.

Jul 2, 2026

News

6 min

Every Investor Passed on Etched in 2023. It Just Booked $1 Billion in Orders.

Etched, the startup that originally built a transformer-only inference chip it called Sohu, announced 1 billion dollars in booked contract orders for its frontier inference clusters. TSMC manufactured the silicon earlier this year, and the company revealed a quiet 500 million dollar December round at a 5 billion dollar post-money valuation, with backers including Jane Street, Two Sigma, Andrej Karpathy, and Geoffrey Hinton. The company now says its product is a full rack-scale system that is not limited to transformer models and has retired the Sohu name.

Jul 2, 2026

News

9 min

Anthropic Made Three Promises. Washington Gave Back Fable 5 and Mythos 5.

The US Commerce Department lifted export controls on Anthropic’s Claude Fable 5 and Mythos 5 on June 30, ending a nineteen-day global blackout. Anthropic agreed to proactively detect security risks, coordinate future model releases with the government, and report malicious activity. The company also shipped a safety filter it says blocks the disputed jailbreak more than 99 percent of the time, and access began restoring on July 1.

Jul 2, 2026

News

8 min

The Pentagon Called Anthropic a Risk. California Gave It the Whole Government.

Governor Gavin Newsom signed a first-of-its-kind deal on June 29 putting Claude in every California agency, city, and county at half price. Months earlier the Pentagon labeled Anthropic a supply-chain risk and signed with OpenAI, and the split shows how state and federal AI buyers are diverging.

Jul 1, 2026