Videos
Today, Anthropic is introducing the next generation of Claude models: Claude Opus 4 and Claude Sonnet 4, setting new standards for coding, advanced reasoning, and AI agents. Claude Opus 4 is the world’s best coding model, with sustained performance on complex, long-running tasks and agent workflows. Claude Sonnet 4 is a drop-in replacement for Claude Sonnet 3.7, delivering superior coding and reasoning while responding more precisely to your instructions.
Claude Opus 4 and Sonnet 4 are hybrid models offering two modes: near-instant responses and extended thinking for deeper reasoning. Both models can also alternate between reasoning and tool use—like web search—to improve responses.
Both Claude 4 models are available today for all paid plans. Additionally, Claude Sonnet 4 is available on the free plan.
Read more here: https://www.anthropic.com/news/claude-4
I have been extensively using Gemini 2.5 Pro for coding-related stuff and O3 for everything else, and it's crazy that within a month or so, they look kind of obsolete. Claude Opus 4 is the best overall model available right now.
I ran a quick coding test, Opus against Gemini 2.5 Pro and OpenAI o3. The intention was to create visually appealing and bug-free code.
Here are my observations
-
Claude Opus 4 leads in raw performance and prompt adherence.
-
It understands user intentions better, reminiscent of 3.6 Sonnet.
-
High taste. The generated outputs are tasteful. Retains the Opus 3 personality to an extent.
-
Though unrelated to code, the model feels nice, and I never enjoyed talking to Gemini and o3.
-
Gemini 2.5 is more affordable in pricing and takes much fewer API credits than Opus.
-
One million context length in Gemini is undefeatable for large codebase understanding.
-
Opus is the slowest in time to first token. You have to be patient with the thinking mode.
Check out the blog post for complete comparison analysis with codes: Claude 4 Opus vs. Gemini 2.5 vs. OpenAI o3
The vibes with Opus are the best; it has a personality and is stupidly capable. But too pricey; it's best used with the Claude app, the API cost will put a hole in your pocket. Gemini will always be your friend with free access and the cheapest SOTA model.
Would love to know your experience with Claude 4 Opus and how you would compare it with o3 and Gemini 2.5 pro in coding and non-coding tasks.