Claude Opus 4.5 is completely insane

Woah this is incredible.

Anthropic just released the new Claude Opus 4.5 -- and it's better than every other coding model at basically everything.

Just look at the insane difference between Opus 4.5 and Sonnet 4.5 in solving this complex puzzle game:

Many devs online have been calling it the greatest coding model ever -- not hard to believe when you see how it stacks up to the other models:

It even beats Gemini 3 Pro that just came out like a week ago:

This is a model built from the ground up to be an agentic software engineer: fixing bugs, refactoring large codebases, navigating unfamiliar repos, and wiring everything together with tools and terminals.

Opus 4.5 isn’t just competitive — it’s designed to be the thing you reach for when failure is expensive.

80% on the SWE-bench verified benchmark is the highest ever any model has ever gotten.

And this SWE-bench Verified is a benchmark where models must actually apply patches that pass tests in real GitHub repos. It's the sort of test where you’re not answering quiz questions -- you’re actually modifying real-world Python projects and passing every single written test in the codebase.

Anthropic also ran it on their two-hour engineering hiring exam and reported that Opus 4.5, under realistic constraints, scored higher than any human candidate they’ve evaluated -- though with the important caveat that it was allowed multiple runs and they picked the best.

You can see that Opus 4.5 is optimized for “here’s a repo, make it work,” not just “explain what a binary search tree is.”

This is advanced software engineering for messy real-world tasks -- far more than just "build a todo list app".

The effort knob: turning up (or down) the brainpower

The most interesting feature for coders is the effort parameter -- exclusive to Opus 4.5 for now.

Instead of swapping between…

Deepgram interviewed 400 senior leaders on voice AI adoption: 97% already use it, 84% will increase budgets, yet only 21% are very satisfied with legacy agents. See where enterprises deploy human-like voice AI agents - customer service, task automation, order capture. Benchmark your roadmap against $100M peers for 2026 priorities.

Download the Report

What 100K+ Engineers Read to Stay Ahead

Your GitHub stars won't save you if you're behind on tech trends.

That's why over 100K engineers read The Code to spot what's coming next.

Get curated tech news, tools, and insights twice a week
Learn about emerging trends you can leverage at work in just 10 mins
Become the engineer who always knows what's next

Join 100k+ engineers

Claude Opus 4.5 is completely insane

Claude Opus 4.5 is completely insane

The effort knob: turning up (or down) the brainpower

Voice AI: Get the Proof. Avoid the Hype.

What 100K+ Engineers Read to Stay Ahead

Keep Reading

Coding Beauty