
Wow I've never seen Sonnet do something like this before. This is huge.
You absolutely cannot ignore this.
I don't even need to compare it to GPT or Gemini or whatever.
Claude Sonnet is actually no longer trying to be a nice little tradeoff between intelligence and cost.
This new Claude Sonnet is here to be a MASSIVE CHALLENGER to its big brother Claude Opus.

And from the numbers I'm seeing, it has made dangerous progress toward achieving that with this new 4.6 update.
It decimated the previous version of Claude Opus (4.5) in basically every metric -- and was incredibly close to the current Opus version -- and even beat this latest Opus in notable areas.
Literally 2nd position in the biggest AI benchmarks out there -- and guess the one model that stopped it from gaining top spot?

It's gotten so much better at automating actions on your computer now (Computer Use):

1 MILLION token context -- trust me this is not a model you want to mess around with.
With Sonnet 4.6, Claude will handle all your real-world, production AI workloads — especially coding and tool use — without the higher cost of Opus.
1. Essential coding upgrade — that we will all feel
Sonnet 4.6 scored 79.6% on SWE-bench Verified, extremely close to Opus 4.6’s ~80.8%, showing near-flagship coding performance at lower cost.
And not just benchmarks. Sonnet 4.6 is here to work with us in real workflows:
Understanding large repos
Editing across multiple files
Avoiding unnecessary rewrites
Following existing structure instead of “overengineering”
In Anthropic’s own testing, developers preferred Sonnet 4.6 over Sonnet 4.5 about 70% of the time in Claude Code, citing better context reading and less duplication/overengineering.

2. Unbelievable Computer Use gains
Anthropic has been massively pushing Computer Use lately: the AI models controlling out software like we would to carry out complex actions for us — clicking, typing, navigating interfaces along the way.
With 4.6, that capability improved significantly.
Sonnet 4.6 achieved 72.5% on OSWorld-Verified, dramatically up from Sonnet 4.5’s ~61.4% and nearly matching Opus 4.6’s ~72.7%, which demonstrates near-parity in practical interface interaction tasks.
Sonnet 4.6 now performs nearly on par with Opus in Computer Use.
That’s a big deal because computer-use tasks are messy. They require:
Reading dynamic UI elements
Recovering from small mistakes
Planning multi-step actions
It’s not perfect, but it’s much closer to “practical assistant” than previous versions.

3. 1M is serious business
Better prompts. Better AI output.
AI gets smarter when your input is complete. Wispr Flow helps you think out loud and capture full context by voice, then turns that speech into a clean, structured prompt you can paste into ChatGPT, Claude, or any assistant. No more chopping up thoughts into typed paragraphs. Preserve constraints, examples, edge cases, and tone by speaking them once. The result is faster iteration, more precise outputs, and less time re-prompting. Try Wispr Flow for AI or see a 30-second demo.
What 100K+ Engineers Read to Stay Ahead
Your GitHub stars won't save you if you're behind on tech trends.
That's why over 100K engineers read The Code to spot what's coming next.
Get curated tech news, tools, and insights twice a week
Learn about emerging trends you can leverage at work in just 10 mins
Become the engineer who always knows what's next



