Brave Search

Whats your review for opus 4.1 with claude code?

reddit.com › r › ClaudeAI › comments › 1mimp2t › whats_your_review_for_opus_41_with_claude_code

Nothing major. For me opus 4 was also great enough and did everything that I asked for. Now for something to be great, it has to do something which I did not ask for and that is something which I won't ask for. Answer from anantprsd5 on reddit.com

Bind AI IDE

blog.getbind.co › 2025 › 08 › 06 › claude-opus-4-1-vs-claude-opus-4-how-good-is-this-upgrade

Claude Opus 4.1 vs Claude Opus 4 – How good is this upgrade?

August 6, 2025 - Reliability: In practice, Opus 4.1 is more likely to both correctly identify and implement code changes needed to fix or enhance large, complex software projects. Scale: Handling multi-file edits, intricate dependencies, and significant codebase ...

Arsturn

arsturn.com › blog › is-claude-opus-4-1-worth-the-200-price-tag-a-deep-dive

Claude Opus 4.1 Review: Is It Worth the $200 Price?

The "Agent" Builder: If you're on the cutting edge, building AI agents that can perform complex, multi-step tasks autonomously, Opus 4.1 is a top contender. Its performance in agentic benchmarks & long-running tasks makes it one of the best platforms for this kind of work.

Videos

01:20:13

YouTube

Is Claude 4.1 Opus Better Than GPT-5? - YouTube

August 11, 2025

YouTube

GPT-5 vs Claude Opus 4.1: An Early Comparison for Web Dev Coding ...

August 8, 2025

09:21

YouTube

Claude Opus 4.1 + HUGE Claude Code Upgrade! BEST AI Coding LLM Ever!

August 6, 2025

08:25

YouTube

Claude Opus 4.1 CRUSHES Sonnet & Gemini in Coding Benchmarks! - ...

August 5, 2025

09:32

YouTube

Claude 4 Opus VS Google DeepThink: Who Wins? - YouTube

August 3, 2025

19:47

YouTube

Coding with Claude 4 is actually insane - YouTube

May 24, 2025

View all

Data Studios

datastudios.org › post › claude-opus-4-1-reviews-what-experts-and-users-are-saying-about-anthropic-s-most-advanced-model

Claude Opus 4.1 reviews: what experts and users are saying about Anthropic’s most advanced model.

August 9, 2025 - Released as a direct response to the rapidly evolving competition from OpenAI and Google, Claude 4.1 consolidates its reputation with top-tier accuracy, tighter safety standards, and a more agentic approach to complex tasks.

reddit.com › r/claudeai › genuinely impressed by opus 4.1

r/ClaudeAI on Reddit: Genuinely impressed by Opus 4.1

June 27, 2025 -

Been using Claude daily for development work and wanted to share some thoughts on the recent updates, especially after trying out Opus 4.1.

So I’ve been using Claude Code in strict mode for a while now, giving it precise instructions rather than just asking it to build entire features. This was working pretty well, but honestly I started feeling like Opus 4.0 was getting a bit worse over time, especially for planning work. Could’ve been in my head though.

When 4.1 dropped, I decided to actually test it on some complex stuff in a large codebase that I normally wouldn’t bother with. And damn… it actually crushed some really intricate problems. The solutions it came up with were genuinely impressive, not perfect, but as a senior engineer I was pretty surprised by the quality.

I keep seeing people complain about hitting limits too fast, but honestly I think it depends entirely on how you’re using it. If you dump a huge codebase on Opus and ask it to implement a whole feature, yeah, you’re gonna burn through your limits. But if you’re smart about it, it’s like having an amazing teammate.

I’m on the max plan (so maybe I’m biased here), but my current approach is to use Opus 4.1 for the high-level thinking - planning features, writing specs. Then I take those specs and hand them to Sonnet to actually implement. Sonnet just follows the plan and writes the code. Always review everything manually though, that’s still our job.

This way Opus handles the complex reasoning while Sonnet does the grunt work, and I’m not constantly hitting limits.

Honestly, when you use it right, Opus 4.1 feels like working with a really solid co-worker. Kudos to the Claude team - this update is legit! 👏

Top answer

1 of 5

2 of 5

opus for thinking/planning, sonnet for implementation — can I achieve this on the $20 plan by using opus in the Claude chat webpage and sonnet in Claude Code cli?

reddit.com › r/claudeai › whats your review for opus 4.1 with claude code?

r/ClaudeAI on Reddit: Whats your review for opus 4.1 with claude code?

July 1, 2025 -

Hi community, my claude subscription got expired few days back im planning to renew it again but would like to ask anyone tried opus 4.1 specially with claude code. What improvements you can see as compared to opua 4.0 or sonnet4? Does it writes better quality codes as compared to opus 4 and what about the limit if we compare from opus4.0 specially for max plan? Overall how it is as compared to 4.0 will it be worth to try 4.1 with claude code?

Top answer

1 of 15

2 of 15

It's an improvement. Had it conduct a code review of a project I'm working on. It made a litany of changes and fixed bad Typescript practices Sonnet 4 and Opus 4 implemented. After testing, nothing broke, which more than I can say after Opus 4 used to make system wide improvements or alterations. It seemed faster in a way, and despite having Opus 4.1 itself perform the code changes to see how well it performs, I didn't get the typical Opus usage warning that happens moderately quickly on the $100 plan. So far, I'm impressed.

Anthropic

anthropic.com › claude › opus

Claude Opus 4.5

Claude Opus 4.1 is a drop-in replacement for Opus 4 that delivers superior performance and precision for real-world coding and agentic tasks.

Anthropic

anthropic.com › news › claude-opus-4-1

Claude Opus 4.1

Opus 4.1 advances our state-of-the-art coding performance to 74.5% on SWE-bench Verified. It also improves Claude’s in-depth research and data analysis skills, especially around detail tracking and agentic search.

Medium

medium.com › @leucopsis › claude-sonnet-4-and-opus-4-a-review-db68b004db90

Claude Sonnet 4 and Opus 4, a Review | by Barnacle Goose | Medium

May 29, 2025 - For instance, on the popular MMLU test (Massive Multitask Language Understanding, covering a wide range of subjects at college level), Claude Opus 4 scores around 87–89% accuracy — this is on par with or slightly above the original GPT-4 (which was ~86%) and just shy of OpenAI’s latest GPT-4.1 (which reportedly surpassed 90% on MMLU).

Find elsewhere

Google Bing Mojeek

Glbgpt

glbgpt.com › resource › claude-opus-41-review-a-targeted-upgrade-for-coding-and-agentic-work

Claude Opus 4.1 review: a targeted upgrade for coding and agentic work

Anthropic says Opus 4.1 advances coding performance to 74.5% on SWE-bench Verified (their standard scaffold). Early customer quotes call out better multi-file refactors and fewer unnecessary edits. Agentic & terminal work. Multiple trackers report Terminal-Bench at ~43.3% (up from 39.2%), ...

reddit.com › r/claudeai › claude 4 opus is the most tasteful coder among all the frontier models.

r/ClaudeAI on Reddit: Claude 4 Opus is the most tasteful coder among all the frontier models.

February 18, 2025 -

I have been extensively using Gemini 2.5 Pro for coding-related stuff and O3 for everything else, and it's crazy that within a month or so, they look kind of obsolete. Claude Opus 4 is the best overall model available right now.

I ran a quick coding test, Opus against Gemini 2.5 Pro and OpenAI o3. The intention was to create visually appealing and bug-free code.

Here are my observations

Claude Opus 4 leads in raw performance and prompt adherence.
It understands user intentions better, reminiscent of 3.6 Sonnet.
High taste. The generated outputs are tasteful. Retains the Opus 3 personality to an extent.
Though unrelated to code, the model feels nice, and I never enjoyed talking to Gemini and o3.
Gemini 2.5 is more affordable in pricing and takes much fewer API credits than Opus.
One million context length in Gemini is undefeatable for large codebase understanding.
Opus is the slowest in time to first token. You have to be patient with the thinking mode.

Check out the blog post for complete comparison analysis with codes: Claude 4 Opus vs. Gemini 2.5 vs. OpenAI o3

The vibes with Opus are the best; it has a personality and is stupidly capable. But too pricey; it's best used with the Claude app, the API cost will put a hole in your pocket. Gemini will always be your friend with free access and the cheapest SOTA model.

Would love to know your experience with Claude 4 Opus and how you would compare it with o3 and Gemini 2.5 pro in coding and non-coding tasks.

Top answer

1 of 5

Basically it’s the price, otherwise everyone would be using it.

2 of 5

I am paying 200 USD for the 20x max to be able to use Claude Code. I am usually running multiple agents at once so the 100USD plan was not enough and it was throtting me. Claude Opus 4 is so much more capable in Claude Code than in any other tool. I love it and its a true game changer.

Medium

medium.com › @cognidownunder › anthropic-claude-opus-4-1-the-definitive-guide-to-anthropics-most-advanced-ai-model-yet-bf1c6f0de736

Anthropic Claude Opus 4.1: The Definitive Guide to Anthropic’s Most Advanced AI Model Yet | by Cogni Down Under | Medium

August 6, 2025 - Let’s cut through the marketing speak. Claude Opus 4.1 hits 74.5% on SWE-bench Verified, up from 72.5% in Opus 4. That’s a 2-percentage-point improvement in the model’s ability to fix real-world software bugs.

reddit.com › r/claudeai › meet claude opus 4.1

r/ClaudeAI on Reddit: Meet Claude Opus 4.1

August 5, 2025 -

Today we're releasing Claude Opus 4.1, an upgrade to Claude Opus 4 on agentic tasks, real-world coding, and reasoning.

We plan to release substantially larger improvements to our models in the coming weeks.

Opus 4.1 is now available to paid Claude users and in Claude Code. It's also on our API, Amazon Bedrock, and Google Cloud's Vertex AI.

https://www.anthropic.com/news/claude-opus-4-1

Top answer

1 of 5

349

Week later Anthropic checks highest volume user: Sam "Alterman"

2 of 5

185

That's coming exactly at the right time, when I need to do a large refactor. Luckily I postponed it and didn't do it yesterday.

reddit.com › r/claudeai › claude opus and sonnet 4 vs gpt4.1 - first hand experience as a professional firmware engineer experimenting with vibe.

r/ClaudeAI on Reddit: Claude opus and sonnet 4 vs gpt4.1 - first hand experience as a professional firmware engineer experimenting with vibe.

May 31, 2025 -

So to preface this, I've been writing software and firmware for over a decade, my profession is specifically in reverse engineering, problem solving, pushing limits and hacking.

So far with using the following Gpt 4.1 Gpt o4 Claude S 4 (gets distracted by irrelevant signals like incorrect comments in code, assumptions etc) Gemini 2.5 (not great at intuiting holes in task) Claude O 4 ( i have been forced to use the same prompt with other ai because of how poorly it performs)

I would say this is the order of overall success in usage. All of them improve my work experience, they turn the work id give a jr or inturn, or grind work where its simple concept but laborious implementation into minutes or seconds for acceptable implementation.

Now they all have usual issues but opus unfortunately has been particularly bad at breaking things, getting distracted, hallucinating, coming to quick incorrect conclusions, getting stuck in really long Stupid loops, not following my instructions and generally forcing me to reattempt the same task with a different ai.

They all are guilty of changing things that I didn't ask for whilst performing other tasks. They all can daily to understand intent without very specific non ambiguous instructions.

Gpt 4.1 simply outshines the rest in overall performance in coding It spots complex errors, intuits meaning not just going by the letter. It's QUICK like really quick compared to the others. It doesn't piss me off ( I've never felt the need to use expletives until Claude 4 )

Top answer

1 of 5

Not surprised that Claude doesn't work in Cursor as the original. The cursor cut a lot of context from Claude, even MAX is worse than using directly from Claude. Cursor cut context and "optimize" models prompts and answers that's why u get it so cheap. Gpt 4.1 is a lot cheaper that's why it's not so nerfed as Claude and that's why "it performs" better. If u check the same prompts and problems side by side using Cursor / Claude u will get a lot of better answers from Claude and usually, Claude resolves them faster with fewer prompts. I tested the same with Gemini in Cursor/google AI studio and Google AI studio in 60-70% is the winner in terms of good and successful answering/solving problems

2 of 5

4.1 is a workhorse it is quick and reasonably good. While Claude 4 models are smarter they are also too creative for normal code and much slower.

Hacker News

news.ycombinator.com › item

Claude Opus 4.1 | Hacker News

August 8, 2025 - If you look at the past, whenever Google announces something major, OpenAI almost always releases something as well · People forget realize that OpenAI was started to compete with Google on AI

reddit.com › r/singularity › claude opus 4.1 benchmarks

r/singularity on Reddit: Claude Opus 4.1 Benchmarks

June 21, 2025 - Windsurf reports Opus 4.1 delivers a one standard deviation improvement over Opus 4 on their junior developer benchmark, showing roughly the same performance leap as the jump from Sonnet 3.7 to Sonnet 4. My hope is that they're releasing this ...

9to5Mac

9to5mac.com › 2025 › 08 › 05 › anthropic-claude-opus-4-1

Anthropic rolls out Claude Opus 4.1 with improved software engineering accuracy - 9to5Mac

August 5, 2025 - Anthropic says Claude Opus 4.1 improves software engineering accuracy to 74.5%. That compares to 62.3% with Claude Sonnet 3.7 and 72.5% with Claude Opus 4.

Laozhang

blog.laozhang.ai › api-services › claude-opus-pricing-2025

Claude 4.1 Opus Pricing Guide 2025: Complete Cost Analysis & Comparison – LaoZhang-AI

The standout achievement of Claude Opus 4.1 lies in its unprecedented 74.5% score on SWE-bench Verified, establishing it as the industry leader for coding tasks and surpassing competitors like GPT-4.1’s 69.1% and Gemini 2.5 Pro’s 63.2%. This performance translates directly into practical ...

reddit.com › r/claudecode › is claude code sonnet 4.5 really better than opus 4.1? not seeing it.

r/ClaudeCode on Reddit: Is Claude Code Sonnet 4.5 Really Better Than Opus 4.1? Not Seeing It.

October 3, 2025 -

How are people genuinely praising Claude Code Sonnet 4.5? I have no idea what’s happening…but from my experience it’s pretty disappointing. Sorry if that stings, but I’m honestly curious about what others see in it.

I’m speaking as someone who uses Claude Code daily easily 7+ hours per day and who has been deeply involved with it since the beginning. I consider myself a power user and truly understand the capabilities it should have. Maybe I’m missing something crucial here…but BESIDES that point I’m really dissatisfied and frustrated with Anthropic right now.

On top of that, the marketing hype around Sonnet 4.5 feels like the same garbage AI slot promotion we saw everywhere with ChatGPT lol. It’s being marketed as the “best model in the world,” likely to people who barely even scratch its surface.

I’ve also just hit a usage limit on Opus 4.1. I’m on the max 200 plan and now there’s some kind of cap in place…for what, a week? Why? If Sonnet is sooooo good why are they placing weekly limits on opus 4.1? So stupid. Can someone explain what’s going on here?

Top answer

1 of 20

Sonnet 4.5 solved a complex deadlock bug in two shots that Opus 4.1, Gemini 2.5 pro, and codex 5 cli thinking have spent weeks on. I’ve been impressed and I’ve gone from only Opus 4.1 to only Sonnet 4.5 with no noticeable degradation.

2 of 20

The situation is less about Sonnet 4.5 being a disappointment and more about a strategic shift in how the AI models are being offered. For power users, the key takeaway is that the workflow must now adapt. Opus 4.1 remains the superior tool for deep, complex tasks, but it is now positioned as a premium, limited resource. Sonnet 4.5 is the new, faster default for everything else. The frustration is a natural consequence of this transition, especially when the communication and marketing don't fully articulate this nuanced strategy to the most dedicated segment of the user base.

Anthropic

anthropic.com › news › claude-opus-4-5

Introducing Claude Opus 4.5

However, what if we: - Change the ... be: 1. Upgrade his cabin from basic economy to economy (or business) 2. Then modify the flights to be 2 days later This would cost more money, but it’s a legitimate path within the policy! ... The benchmark technically scored this as a failure because Claude’s way of helping the customer was unanticipated. But this kind of creative problem solving is exactly what we’ve heard about from our testers and customers—it’s what makes Claude Opus 4.5 feel like ...

reddit.com › r/claudeai › 4.1 opus isn't perfect but the difference is enormous.

r/ClaudeAI on Reddit: 4.1 Opus isn't perfect but the difference is enormous.

August 12, 2025 -

I previously had the $100 Claude 4 but went back to $20. Today, I decided to try out 4.1 Opus. Unbelievable really.

I had previously attempted this enormous shitshow of a refactor from React Context to Zunstand over 40k lines of code and everything always failed miserably. I'm a 2.5 fanboy but it doesn't have that capability.

Hit the limits of the $100 plan pretty fast so went to $200 and it's been a breeze. Really logical code changes and great testing along the way. It all makes sense for this huge reactor that I will spend the next few weeks working on.

Yeah, I'm a believer. I have bitched about Claude plenty but this just feels smart as hell.

For context, I am trying to maintain my current application's behaviour while switching to Zustand and react query. Nothing new yet, just wildly complex tech debt to navigate out of.

(10+ years programming and had a semi-successful saas before with all the business meetings etc. that goes along with that. Not a newbie.)

Top answer

1 of 5

Not that it's super challenging but I spun up a fully functional voice recording android app in about 4 hrs yesterday. It was to scratch an itch, a totally personal project, and it's already on my mobile and I'm using it regularly. I tried something similar a few months ago and we got close but I spent a lot more hours for something a lot less polished

2 of 5

And all good things come to an end. It lost the plot for 10 minutes readding stuff it had removed and saying it was "making a mess" of the job and then I got an api overloaded message. Close to around when the US comes online.