claude 4 sonnet thinking

Claude Opus 4 and Sonnet 4 are hybrid models offering two modes: near-instant responses and extended thinking for deeper reasoning. The Pro, Max, Team, and Enterprise Claude plans include both models and extended thinking, with Sonnet 4 also available to free users.

Anthropic

anthropic.com › claude › sonnet

Claude Sonnet 4.5

For complex financial analysis—risk, structured products, portfolio screening—Claude Sonnet 4.5 with thinking delivers investment-grade insights that require less human review.

Videos

reddit.com

r/ClaudeAI on Reddit: Sonnet 4.5: Two Must-Know Chat Retrieval ...

August 8, 2025

reddit.com

r/ClaudeAI on Reddit: Claude Sonnet 4.5's Most Impressive New Tool ...

October 7, 2025

03:25

YouTube

Claude Sonnet 4.5 Is Here… And It’s BETTER Than Opus?! (First ...

September 29, 2025

15:55

YouTube

Claude 4.5! (30 Hours of Thinking!) - YouTube

October 1, 2025

33:17

YouTube

Sonnet 4.5 is the best coding model in the world - YouTube

September 30, 2025

12:40

YouTube

Sonnet 4.5 Is Here—And It’s a Beast at Coding - YouTube

September 29, 2025

View all

AI SDK

ai-sdk.dev › cookbook › guides › claude-4

Guides: Get started with Claude 4

The code snippet below is all you ... capabilities first introduced in Claude 3.7 Sonnet—the ability to solve complex problems with careful, step-by-step reasoning....

Artificial Analysis

artificialanalysis.ai › models › claude-4-sonnet-thinking

Claude 4 Sonnet - Intelligence, Performance & Price Analysis | Artificial Analysis

Analysis of Anthropic's Claude 4 Sonnet (Reasoning) and comparison to other AI models across key metrics including quality, price, performance (tokens per second & time to first token), context window & more.

reddit.com › r/claudeai › claude opus 4 and claude sonnet 4 officially released

r/ClaudeAI on Reddit: Claude Opus 4 and Claude Sonnet 4 officially released

May 22, 2025 -

Source: Code with Claude Opening Keynote

Top answer

1 of 5

398

we’ve significantly reduced behavior where the models use shortcuts or loopholes to complete tasks. Both models are 65% less likely to engage in this behavior than Sonnet 3.7 on agentic tasks that are particularly susceptible to shortcuts and loopholes. This is a very welcome improvement.

2 of 5

191

“Opus consumes usage limits faster than other models” Although it’s well-known, seeing this explicitly written out makes me kinda nervous for usage limits

Claude

docs.claude.com › en › docs › about-claude › models › whats-new-sonnet-4-5

What's new in Claude Sonnet 4.5 - Claude Docs

Extended thinking for optimal performance: Claude Sonnet 4.5 performs significantly better on coding tasks when extended thinking is enabled. Extended thinking is disabled by default, but we recommend enabling it for complex coding work.

Claude Docs

platform.claude.com › docs › en › about-claude › models › whats-new-claude-4-5

What's new in Claude 4.5 - Claude Docs

Claude Opus 4.5 automatically preserves all previous thinking blocks throughout conversations, maintaining reasoning continuity across extended multi-turn interactions and tool use sessions.

The Neuron

theneuron.ai › explainer-articles › everything-to-know-about-claude-4-sonnet-4-opus-4-from-the-good-to-the-bad-and-the-mid

Everything to know about Claude 4 (Sonnet 4, Opus 4), from the Good, to the Bad, and the REAL weird... | The Neuron

Both Opus 4 and Sonnet 4 are "hybrid reasoning models" with two modes. Claude 4 has two modes, depending on what you need: near-instant responses and an "extended thinking" mode for deeper, multi-step reasoning.

Anthropic

anthropic.com › news › claude-sonnet-4-5

Introducing Claude Sonnet 4.5

For complex financial analysis—risk, structured products, portfolio screening—Claude Sonnet 4.5 with thinking delivers investment-grade insights that require less human review.

Find elsewhere

Google Bing Mojeek

Poe

poe.com › Claude-Sonnet-4.5

Claude-Sonnet-4.5 - Poe

Claude Sonnet 4.5 represents a major leap forward in AI capability and alignment. It is the most advanced model released by Anthropic to date, distinguished by dramatic improvements in reasoning, mathematics, and real-world coding. Supports 1m tokens of context.

Medium

medium.com › @leucopsis › claude-sonnet-4-and-opus-4-a-review-db68b004db90

Claude Sonnet 4 and Opus 4, a Review | by Barnacle Goose | Medium

May 29, 2025 - It offers “superior coding and reasoning while responding more precisely to your instructions”, at a lower cost and faster speed than Opus 4. Both models introduce “hybrid reasoning” abilities — they can produce near-instant answers or engage an extended thinking mode for step-by-step reasoning on harder problems...

DataCamp

datacamp.com › blog › claude-4

Claude 4: Tests, Features, Access, Benchmarks & More | DataCamp

May 23, 2025 - For comparison, Gemini 2.5 Pro (Google’s flagship model) has a context window of 1M tokens. It’s also capable of running in “extended thinking” mode, where it switches from fast responses to slower, more deliberate reasoning.

reddit.com › r/claudeai › why i think claude sonnet 4 thinking is the best

r/ClaudeAI on Reddit: Why I think Claude Sonnet 4 Thinking is the best

June 20, 2025 -

After trying several Free version of several assistants (GitHub Copilot, ChatGPT, etc.), Claude Sonnet 4 Thinking 🙌 stands out for me as the best coding assistant so far. A few things that sold me:

Reasoning-first answers — it walks through why an approach works (or doesn’t), not just pastes code.

Multi-file context — it keeps track of project structure and gives consistent suggestions across files.

Refactor & tests — it suggests concise refactors and generates unit tests that actually catch edge-cases.

Debugging help — when I paste stack traces or failing tests it narrows the root cause quickly and suggests minimal fixes.

Readable style — produced code is readable and easy to adopt; less hand-holding required.

Not perfect — token limits and cost can be a factor for very large projects, and sometimes you still need to vet outputs. But for me the time saved + improved code quality outweighs those. Curious what others use for deep debugging or multi-file refactors.

Anyone else prefer Claude for coding? Why/why not?

Do you like this personally?

Top answer

1 of 9

Sonnet will forever be my boi.

2 of 9

Opus 4.1 continues to impress me each day. It's fast, great at refactoring, often fixes linter and TS errors induced by other models, and it produces very good mostly clean code, imo. It'll be interesting when Anthropic releases Sonnet 4.1 since it may surpass Opus' abilities in many ways.

Poe

poe.com › Claude-Sonnet-4

Claude-Sonnet-4 - Poe

Claude Sonnet 4 from Anthropic, supports customizable thinking budget (up to 30k tokens) and 1m context window. To instruct the bot to use more thinking effort, add --thinking_budget and a number ranging from 0 to 30,768 to the end of your message.

The Verge

theverge.com › news › ai › tech

Anthropic’s Claude 4 AI models are better at coding and reasoning | The Verge

May 22, 2025 - Image: Anthropic · A new feature introduced for both Claude 4 models is “thinking summaries,” which condenses the chatbots’ reasoning process into easily understandable insights.

Cursor

forum.cursor.com › bug reports

Claude 4.5 Sonnet no longer thinking? - Bug Reports - Cursor - Community Forum

October 21, 2025 - Where does the bug appear (feature/product)? Cursor IDE Describe the Bug About…maybe, 40 minutes ago now, it appears that Claude 4.5 Sonnet Thinking, simply stopped thinking. I am still on the thinking version of the mo…

reddit.com › r/claudeai › we all are living in the sonnet 4 bubble

r/ClaudeAI on Reddit: we all are living in the sonnet 4 bubble

April 5, 2025 -

Starting off: Don't get me wrong, Sonnet 4 is legendary model for coding. It's so good, maybe even too good. It has zero-shot basically every one of my personal tests in Cursor and a couple complex Rust problems I always test LLMs with.

I belive most people have hugely praised Sonnet 4 with good reasons. It's extremely good at coding, yet due to the fact that lots of people in this sub are coders, they often feel they're whole day gets more productive. What they don't realize is that this model is kinda bad for normies. This model on a personal note has felt severely overtrained on code and likely caused catastrophic forgetting in this model. It feels severely lobotimized on non-code related tasks.

Opus 4 however seems to be fine, it has gone through my math tasks without and issues. Just too expensive to be a daily driver tho.

Here is one of the grade 9 math problem from math class I recently had to do (yes im in high school). I decided to try Sonnet 4 on it.

Math Problem

I gave Sonnet 4 (non-reasoning) this exact prompt of "Teach me how to do this question step-by-step for High School Maths" and GPT-4.1 the same prompt with this image attached.

Results:

Sonnet 4

Sonnet 4 got completely confused and starts just doing confusing random operations and gets lost. Then gives me some vague steps and tries to get me to solve it???? Sonnet 4 very rarely gets it right, it either starts trying to make the user solve it or gives out answers like (3.10, 3.30, 3.40 and etc).

GPT-4.1 Response:

GPT-4.1 Response

I have reran the same test on GPT 4.1 also many times and it seems to get it right every single time. This is one of the of dozens of questions I have found Sonnet 4 getting consistenly wrong or just rambles about. Whereas GPT-4.1 hits it right away.

People in AI all believes these models are improving so much (they are) but normies don't experience that much. As I believe the most substantial improvements on these models recently were code. whereas normies don't code, they can tell it improved a bit, but not a mind blowing amount.

Top answer

1 of 5

100

I don't think C4 is built for the 'normies' ChatGPT has already got the normies. (Ghibli, memes, endless emojis) Claude Sonnet 4 is for 'real' work. 'Normies' don't lose sleep over agentic reliability ect...

2 of 5

The solution is to have Anthropic create a non-coding model for that. I don't want a generic model. If I code, I want the best model for the job.

Medium

medium.com › @cognidownunder › claude-sonnet-4-5-4ddf33d53cd4

Claude Sonnet 4.5 Extended Thinking Explained: Can Code for 30 Hours Straight, But That’s Not the Scary Part | by Cogni Down Under | Medium

September 30, 2025 - Claude Sonnet 4.5 runs two distinct modes that fundamentally alter how we interact with code generation. Standard mode gives you near-instant responses like every other model you’ve used. Extended thinking mode opens a window into the machine’s mind.

n8n

n8n.io › workflows › 4399-anthropic-ai-agent-claude-sonnet-4-and-opus-4-with-think-and-web-search-tool

Anthropic AI Agent: Claude Sonnet 4 and Opus 4 with Think and Web Search tool | n8n workflow template

Claude Sonnet 4: Best for standard tasks like real-time workflow routing, data validation, and routine business logic. Claude Opus 4: Reserved for complex scenarios requiring deep reasoning, advanced analysis, or high-impact decisions.

reddit.com › r/claudeai › claude 4 sonnet thinking vs non-thinknig

r/ClaudeAI on Reddit: Claude 4 Sonnet Thinking vs Non-thinknig

April 23, 2025 -

I have a question regarding the thinking vs non-thinking versions for Sonnet 4. I've been using the 3.7 non-thinking version because it shows less over-eagerness and better rule-following. With Claude 4, since its over-eagerness is now tamed, does it make sense to use the thinking mode?

What is the thinking model supposed to help with? I'm using it with Cline and need clarity on whether thinking mode is worth it. If so, how many tokens should be allocated to thinking mode for optimal results?

I'm not too concerned with costs but prioritize better rule-following and problem-solving.

Top answer

1 of 1

The thinking mode **could** help with complex problems. For example, when you facing a challenging bugs, or you are creating a implementation plan with checkboxes, or you just want to try other alternatives, add a trigger word this: "ultrathink: bla bla bla" This will give Claude more time (or computation power) to generate the response. You don't need to use it all the time, just sparingly when needed.