Opus follows instructions better than sonnet for big tasks, but it can sometimes overthink or overengineer something. Sonnet tends to be more direct but can sometimes not follow instructions as good as opus. I'd say opus is great at planning and executing big tasks, while sonnet is best for simpler tasks that dont need the intelligence of opus.or at least plan with opus, code with sonnet, is another way. Answer from Hauven on reddit.com
🌐
Reddit
reddit.com › r/claudeai › opus 4 vs sonnet 4
r/ClaudeAI on Reddit: Opus 4 vs Sonnet 4
May 26, 2025 -

I work in quantitative finance, so most of my programming revolves around building financial tools that detect and exploit market anomalies. The coding I do is highly theoretical and often based on insights from academic finance research.

I’m currently exploring different models to help me reason through and validate my approaches. Does anyone have experience using Opus 4 of Sonnet 4 for this kind of work? I’m trying to figure out what is the best fit for my use case.

🌐
Reddit
reddit.com › r/claudeai › claude opus vs 3.7 sonnet for coding
r/ClaudeAI on Reddit: Claude Opus vs 3.7 Sonnet for coding
March 17, 2025 -

Hey everyone, I've been using Claude 3.7 Sonnet for coding projects and now via Claude Code with a MAX subscription, but notice it still tends to over-engineer solutions and ignores explicit instructions to keep things simple (KISS, DRY, YAGNI, etc.) in my CLAUDE.md, prompts, and project instructions in Claude Desktop/Claude.ai.

I always forget Opus exists, and am wondering if anyone has any input on Opus vs. Sonnet 3.7 for coding and math?

Thanks for your suggestions!

Note: I’ve developed what I feel should be the perfect instructions and memory for Sonnet 3.7 to follow but it still needs to constantly be corrected and reminded.

🌐
Reddit
reddit.com › r/claudeai › which is better for coding? sonnet or opus?
r/ClaudeAI on Reddit: Which is better for coding? Sonnet or Opus?
July 31, 2024 - Continue this thread ... Claude is 100% better at coding. Combine that with Claude Queue extension and you pretty much have a junior dev on stand by ... I've been using Claude 3.5 Sonnet in a variety of contexts for code assistance.
🌐
Reddit
reddit.com › r/claudeai › is opus significantly better than sonnet for software development?
r/ClaudeAI on Reddit: Is Opus significantly better than Sonnet for software development?
April 25, 2024 -

I've been playing with the free version giving iy some requirements and testing the code it produces. While it's pretty cool to see it understand the requirements and produce ok code, i have to go thru a lot of iterations to get it to what i expect.

I wonder if Opus is significantly "smarter" when writing software based on vague'isg requirements. Please share your experience.

Top answer
1 of 8
40
Here’s my workflow: Custom ChatGPT Claude Prompt Generator using -Anthropic’s prompt engineering documentation in uploaded reference material to craft prompts for Claude from my natural language. GPT generates XML formatted and structured instructions and tasks for Claude to easily digest and provide optimal output. Step 1: Flesh out an idea and ask Opus to create a detailed explanation of the task at hand and propose a potential workflow to build a solution. Step 2: Feed Opus’ idea to my ChatGPT prompt generator and have it produce a prompt in XML format with code snippets as example outputs, roles (you are a senior software dev), and structured tasks and contexts. ChatGPT is surprisingly good at generating Claude XML if you give it the documentation. Step 3: Get Sonnet to generate the initial solution and code with the ChatGPT formatted prompt. Step 4. Feed the Sonnet code back to my ChatGPT Prompt to construct an XML prompt asking Claude to verify the code against the initial Sonnet prompt and review any errors, improvements, inaccuracies or other observations. Step 5: Feed the validation prompt, the initial prompt, and the code into Opus. The XML formatted GPT prompt is actually essential for making sure Opus understands what each file is and what to do with it. Step 6: Use Opus to regenerate certain parts of code or observations for improvement it has made in Sonnets code, with many-shot approach. Step 7: If any issues are not making progress, just fix and touch them up myself. Step 8: Verify the finished code between a Non-custom GPT and Opus simultaneously, multiple times. You’ll know that the models can’t do much more for you when they both start suggesting the same minor improvements. They’ll usually suggest different improvements, which is good. I find that ChatGPT can sometimes spot things Opus can’t, but using that information I can instruct Opus to correct the problem and it does so better than GPT. In summary, GPT and Opus are a strong tag team at planning, small logical revisions and debugging, but you’re wasting tokens using Opus to generate code, and you’re wasting time using GPT to generate code. They also work very well together if you explain that you are using both of them to collaborate on a project, they seem to understand the pitfalls and areas to focus on when they understand the context of being paired with each other in collaboration. For example, for GPT: “You generated this prompt for Claude, and Claude responded with this prompt” Sonnet is quite capable and fast, too. For less complex projects, even Haiku is very reliable. Opus acts as a project director and supervisor. GPT acts as a manager. Sonnet and Haiku act as the developers. I don’t really care what benchmarks say, because the benchmarked GPT models are definitely not what you get with a GPT subscription or API key. Anthropic’s public models seem to be more aligned with their benchmarked models. Perhaps context window is key, or perhaps quality of training data surpasses quantity of training data, and perhaps the benchmarks we have currently are not as applicable for assisting developers who aren’t PhD AI researchers conducting benchmark tests. Claude just has more energy. He’s like that guy who wants to help and puts his hand up to answer questions in class. GPT acts like I’m not paying it enough to be at work. Even if GPT was benchmarked significantly higher than Claude, you’re still going to get more done with the enthusiastic guy. I just wish these AI platforms would start adopting subscription models where you can pay exorbitant fees to avoid getting caught in the hardware with everybody else paying 20 dollars or using their API balance. Finally: To review a completed code base, use greptile. Not cursor, not aids, or whatever else it’s called. Not Codeium. Currently, codebases will fuck with the quality of your output. Multiple files, specifically. It’s worth aggregating everything into one or two files and then modularising it manually later. Greptile is the only platform that can actually productively use an entire code base. I highly suggest using Greptile at all advanced stages in your projects development, as Claude and GPT are not even close to Greptiles ability to contextualise code. Greptile can help generate prompts with contextual reminders.
2 of 8
19
I've used both and I felt like chatGpt is trying to bs me from time to time and doesn't correct the code based of my feedback, Opus just gets the job done much more smoothly. You should give it a try.
Find elsewhere
🌐
Reddit
reddit.com › r/claudecode › is claude code sonnet 4.5 really better than opus 4.1? not seeing it.
r/ClaudeCode on Reddit: Is Claude Code Sonnet 4.5 Really Better Than Opus 4.1? Not Seeing It.
October 3, 2025 -

How are people genuinely praising Claude Code Sonnet 4.5? I have no idea what’s happening…but from my experience it’s pretty disappointing. Sorry if that stings, but I’m honestly curious about what others see in it.

I’m speaking as someone who uses Claude Code daily easily 7+ hours per day and who has been deeply involved with it since the beginning. I consider myself a power user and truly understand the capabilities it should have. Maybe I’m missing something crucial here…but BESIDES that point I’m really dissatisfied and frustrated with Anthropic right now.

On top of that, the marketing hype around Sonnet 4.5 feels like the same garbage AI slot promotion we saw everywhere with ChatGPT lol. It’s being marketed as the “best model in the world,” likely to people who barely even scratch its surface.

I’ve also just hit a usage limit on Opus 4.1. I’m on the max 200 plan and now there’s some kind of cap in place…for what, a week? Why? If Sonnet is sooooo good why are they placing weekly limits on opus 4.1? So stupid. Can someone explain what’s going on here?

🌐
Reddit
reddit.com › r/claudeai › opus vs sonnet 3.5
r/ClaudeAI on Reddit: Opus vs Sonnet 3.5
July 8, 2024 -

I had a subscription last with claude opus last May and did not renew after. That was before Sonnet 3.5 was released, right now I am using it on coding and surprisingly it was better than opus when I used opus last May. Question, is it really better than opus in coding or opus also got upgraded same as sonnet? I am in dillema if I am going to subscribe again or not.

🌐
Reddit
reddit.com › r/claudeai › claude code sonnet vs opus?
r/ClaudeAI on Reddit: Claude code Sonnet vs Opus?
March 12, 2025 -

To all Max users. What is the difference in capability between Sonnet vs Opus? Because benchmarks alone aren’t really saying that opus is better than sonnet. But I wonder what it feels like in Claude code. Is opus better with complex tasks and has it been able to do things that sonnet wasn’t capable of? If so what are criteria that makes you use sonnet or opus? Thank you.

🌐
Reddit
reddit.com › r/claudecode › opus 4.1 vs sonnet 4.5
r/ClaudeCode on Reddit: Opus 4.1 vs Sonnet 4.5
October 29, 2025 -

Curious to know what is other's experience using these models? I feel like even with Max plan, i am forced to use Sonnet 4.5 - but holy fuck it's stupid compared to Opus 4.1, it's a fucking moron, cute and funny one, but its IQ can't be above 70. Nevertheless, at least he's a great little coder, when u tell it what to do and test its results comprehensively.

Do you use Opus or Sonnet, and why? Any tips/tricks that makes Sonnet smarter?

🌐
Reddit
reddit.com › r/singularity › aider coding benchmarks for claude 4 sonnet & opus
r/singularity on Reddit: Aider coding benchmarks for Claude 4 Sonnet & Opus
May 26, 2025 - "We are reserving Claude 4 Sonnet...for things that are quite significant leaps, which are coming soon" - Dario Amodei - Anthropic CEO ... Claude Opus 4.5 beats every major model on SWE bench and ARC-AGI. The capability jump is bigger than it looks.
🌐
Reddit
reddit.com › r/claudeai › sonnet 4.5 vs opus 4.1
r/ClaudeAI on Reddit: Sonnet 4.5 vs Opus 4.1
September 29, 2025 -

I've been using fully Claude Opus 4.1 in my terminal setup for coding, reasoning, and agent-like tasks. it's been solid for complex workflows. But now that Sonnet 4.5 is out, I'm wondering if I should switch. From benchmarks, it seems to match or beat Opus in areas like coding (higher scores on SWE-Bench and agentic tasks), visual reasoning, and handling nuanced instructions with better efficiency for iterative sessions. If you've tried both in a CLI/terminal environment, what's your take? Does Sonnet hold up for deep reasoning and long-chain planning, or does Opus still edge it out there?

For complex workflows, would you recommend switching? Experiences appreciated!

🌐
Reddit
reddit.com › r/claudecode › sonnet's fine, but opus is the one that actually understands a big codebase
r/ClaudeCode on Reddit: Sonnet's fine, but Opus is the one that actually understands a big codebase
October 17, 2025 -

I love Claude Code, but I've hit a ceiling. I'm on the Max 20 plan ($200/month) and I keep burning through my weekly Opus allowance in a single day, even when I'm careful. If you're doing real work in a large repo, that's not workable.

For context: I've been a SWE for 15+ years and work on complex financial codebases. Claude is part of my day now and I only use it for coding.

Sonnet 4.5 has better benchmark scores, but on large codebases seen in the industry it performs poorly. Opus is the only model that can actually reason about large, interconnected codebases.

I've spent a couple dozen hours optimising my prompts to manage context and keep Opus usage to a minimum. I've built a library of Sonnet prompts & sub-agents which:

  • Search through and synthesise information from tickets

  • Locate related documentation

  • Perform web searchers

  • Search the codebase for files, patterns & conventions

  • Analyse code & extract intent

All of the above is performed by Sonnet. Opus only comes in to synthesise the work into an implementation plan. The actual implementation is performed by Sonnet to keep Opus usage to a minimum.

Yet even with this minimal use I hit my weekly Opus limits after a normal workday. That's with me working on a single codebase with a single claude code session (nothing in parallel).

I'm not spamming prompts or asking it to build games from scratch. I've done the hard work to optimise for efficiency, yet the model that actually understands my work is barely usable.

If CC is meant for professional developers, there needs to be a way to use Opus at scale. Either higher Opus limits on the Max 20 plan or an Opus-heavy plan.

Anyone else hitting this wall? How are you managing your Opus usage?

(FYI I'm not selling or offering anything. If you want the prompts I spoke about they're free on this github repo with 6k stars. I have no affiliation with them)

TLDR: Despite only using Opus for research & planning, I hit the weekly limits in one day. Anthropic needs to increase the limits or offer an Opus-heavy plan.

🌐
Reddit
reddit.com › r/claudeai › claude 3.5 sonnet v. claude 3 opus - who's better at creative writing?
r/ClaudeAI on Reddit: Claude 3.5 Sonnet v. Claude 3 Opus - who's better at creative writing?
June 23, 2024 -

Has anyone really experimented with these two with regards to creative writing? And if so, which one do you think is better? Sonnet 3.5 has a lot of impressive capabilities for sure but I wonder if its writing quality is really on par with Opus 3. Thoughts?

Top answer
1 of 13
32
Well, Sonnet is THEORETICALLY, better than perhaps any other LLM, but its refusal to do anything graphic even when it does things graphic all on its own without my say-so makes me want to just use GPTo for creative stuff only despite it clearly being worse. Like, it's GREAT. It's amazing, but the censorship is bad. I don't know if paying money for it makes that better but it certainly hasn't shown it just based on that basis alone.
2 of 13
19
Gemini 1.5 pro is the best model I've seen so far for creative writing. Immediately followed by Opus, but it depends: Opus needs some prompting to avoid clichés and grandiose prose, otherwise everything will be a magnificent crescendo of love woven into the fabric of existence. If you remove that, and strike the right balance, it's fantastic and can nail a large variety of styles. If you provide examples and cooperate to write together, it's even better. You just need to be patient and edit the prompt until success. Sonnet 3.5 nails realistic prose much quicker, without embellishments. I wouldn't define it particularly creative though, unless you want irony/humor (basically each model I know after 70B is good at humor with the correct prompt) If you have access to Opus, compare the three for your use case (you can use Gemini pro for free here, just remember to select the right model. You can also remove some safety filters: https://ai.google.dev/aistudio ) Also keep in mind that it depends a lot on your specific style and prompts. Sometimes Gemini failed me miserably, and Sonnet didn't.
🌐
Reddit
reddit.com › r/claudeai › are latest sonnet and opus models better than aistudio.google.com ?
r/ClaudeAI on Reddit: Are latest Sonnet and Opus models better than aistudio.google.com ?
May 30, 2025 -

I've been using Cursor with Sonnet 3.5 on coding for months but for the last month Cursor's overall performance highly degraded. Then 2 weeks ago I came across aistudio.google.com with Gemini Pro 2.5 05-06 experimental model which is currently entirely free on the webui, and it's performance greatly impressed me so I solely use it since then.

But now I wonder if $100 or $200 Max tiers of Claude would serve me better or not for same coding tasks.

Can anybody compare?