Horrible so far tonight. Seems like it's not actually reading what I type, making tons of assumptions, and generally being lazy about things like reading files/logs. Answer from Independent-Space643 on reddit.com
CodeRabbit
coderabbit.ai › blog › claude-sonnet-45-better-performance-but-a-paradox
Claude Sonnet 4.5: Better performance but a paradox
We benchmarked Claude Sonnet 4.5 as it narrows the gap with Opus 4.1 in code review, catching more critical bugs with 41% important hits at lower cost.
Videos
18:56
Claude Sonnet 4.5 Is INSANE (Full Review + Real Examples) - YouTube
33:17
Sonnet 4.5 is the best coding model in the world - YouTube
16:08
Claude Sonnet 4.5 AI Vibe Coding is INSANE (Sonnet 4.5 Code Fast ...
11:19
Why I DELETED Claude Sonnet 4.5 from ALL MY PROJECTS - YouTube
05:05
Claude Haiku 4.5 Is Here… And It’s BETTER Than Sonnet 4.5?! ...
06:11
Claude Sonnet 4.5 VS Claude Sonnet 4 - Which Is The BEST Model ...
Who should use Claude Sonnet 4.5?
Claude Sonnet 4.5 is ideal for developers, writers, researchers, and professionals needing deep reasoning, detailed analysis, and long-context understanding. It’s best for complex problem-solving, code analysis, research synthesis, and creative writing.
humai.blog
humai.blog › claude-sonnet-4-5-review-is-this-really-the-smartest-ai-model-now
Claude Sonnet 4.5 Review: Is This Really the Smartest AI Model Now?
What are the limitations of Claude Sonnet 4.5?
Claude Sonnet 4.5 cannot generate images, has limited real-time web access, and can be overly cautious on sensitive topics. It may also make factual errors on niche or recent information and cannot execute code directly.
humai.blog
humai.blog › claude-sonnet-4-5-review-is-this-really-the-smartest-ai-model-now
Claude Sonnet 4.5 Review: Is This Really the Smartest AI Model Now?
Is Claude Sonnet 4.5 worth the cost?
Yes. For $20 per month through Claude Pro, users get access to Sonnet 4.5 along with Opus and Haiku. The value is strong for professionals, writers, developers, and researchers who use AI regularly. Occasional users can try the free tier before upgrading.
humai.blog
humai.blog › claude-sonnet-4-5-review-is-this-really-the-smartest-ai-model-now
Claude Sonnet 4.5 Review: Is This Really the Smartest AI Model Now?
Reddit
reddit.com › r/claudeai › what are your first impressions of sonnet 4.5?
r/ClaudeAI on Reddit: What are your first impressions of Sonnet 4.5?
September 30, 2025 -
Curious about any specific areas where it seems to excel or struggle compared to Opus 4.1 & Sonnet 4.
Top answer 1 of 39
12
Horrible so far tonight. Seems like it's not actually reading what I type, making tons of assumptions, and generally being lazy about things like reading files/logs.
2 of 39
9
Just used it to tweak my CV (resume) and it completely rewrote it having made up a random masters degree I'd never heard of and added a lot of random technical skills I don't actually have. The prompt explicitly told it to use an existing CV and a job description I provided in the context window and to make it completely factual and non subjective. When I asked it to change the US spelling to British it failed quite miserably so I ended up giving up half way through and doing it myself in Google docs. Also tried it's hardest to connect to my GMAIL using a MS connector and then broke down in a sobbing heap when I corrected it, claiming to not have any clue about Claude connectors vs MCP servers. No way I'm letting it loose in any of my compute environments just yet. Shame, I was quite excited to read about the release this morning and toying with the idea about re-upgrading my account again. I think I'll wait.
Anthropic
anthropic.com › news › claude-sonnet-4-5
Introducing Claude Sonnet 4.5
Experts in finance, law, medicine, and STEM found Sonnet 4.5 shows dramatically better domain-specific knowledge and reasoning compared to older models, including Opus 4.1. ... We're seeing state-of-the-art coding performance from Claude Sonnet 4.5, with significant improvements on longer horizon ...
Every
every.to › vibe-check › vibe-check-claude-sonnet-4-5
Vibe Check: Claude Sonnet 4.5
September 29, 2025 - Anthropic just rolled out Claude Sonnet 4.5, and, of course, we spent the weekend using it to code and running long agentic tasks with it. The headline: It’s noticeably faster, more steerable, and more reliable than · Opus 4.1—especially inside Claude Code. In head-to-head tests it blitzed through a large pull request review in minutes, handled multi-file reasoning without wandering, and stayed terse when we asked it to.
Humai
humai.blog › claude-sonnet-4-5-review-is-this-really-the-smartest-ai-model-now
Claude Sonnet 4.5 Review: Is This Really the Smartest AI Model Now?
November 4, 2025 - When Anthropic calls Sonnet 4.5 their "smartest model," they're making a specific claim about reasoning capability, context understanding, and output quality. Let me break down what this actually means in practice: The most noticeable improvement is in complex reasoning tasks. When you ask Claude Sonnet 4.5 to analyze something multifaceted—a business problem with competing priorities, a philosophical question with multiple perspectives, a technical challenge requiring several steps—the depth of analysis is noticeably better than previous models.
DataCamp
datacamp.com › blog › claude-sonnet-4-5
Claude Sonnet 4.5: Tests, Features, Access, Benchmarks, and More | DataCamp
September 30, 2025 - I get the impression that with this release, Anthropic is targeting enterprise customers, too. With an emphasis on coding for long stretches autonomously and better handling of science and finance tasks, there is a strong push for Claude Sonnet 4.5 to be the go-to model for complex coding tasks.
Binary Verse AI
binaryverseai.com › home › ai models & platforms › claude sonnet 4.5 review: everything you need to know
Claude Sonnet 4.5 Review: Definitive Benchmarks, Best Price
GPT-5 Codex often digs deeper, ... changes, GPT-5 Codex may still edge it in places. If you need quick, correct increments, Claude Sonnet 4.5 is a strong daily driver....
Published September 30, 2025
Anthropic
anthropic.com › news › claude-opus-4-5
Introducing Claude Opus 4.5
Claude Opus 4.5 excels at long-horizon, autonomous tasks, especially those that require sustained reasoning and multi-step execution. In our evaluations it handled complex workflows with fewer dead-ends. On Terminal Bench it delivered a 15% improvement over Sonnet 4.5, a meaningful gain that ...
Anthropic
anthropic.com › claude › sonnet
Claude Sonnet 4.5
Claude Sonnet 4.5's edit capabilities are exceptional — we went from 9% error rate on Sonnet 4 to 0% on our internal code editing benchmark. Higher tool success at lower cost is a major leap for agentic coding.
Skywork
skywork.ai › home › claude 4.5 vs claude 3.5 (2025): should you upgrade now?
Claude 4.5 vs Claude 3.5 (2025): Upgrade Decision & Comparison
September 30, 2025 - If you already rely on Claude 3.5 for coding, research, or agentic workflows, the real question is whether 4.5 tangibly reduces rework and failures enough to justify an immediate upgrade. Below is a pragmatic, scenario-led comparison to help you decide. ... Pricing parity: Anthropic reiterated that Sonnet 4.5 keeps Sonnet-tier pricing at $3 per million input tokens and $15 per million output tokens in 2025 (see the Anthropic 4.5 announcement, 2025).
Hacker News
news.ycombinator.com › item
Claude Sonnet 4.5 | Hacker News
August 22, 2025 - It's very good - I think probably a tiny bit better than GPT-5-Codex, based on vibes more than a comprehensive comparison (there are plenty of benchmarks out there that attempt to be more methodical than vibes) · It particularly shines when you try it on https://claude.ai/ using its brand ...
LessWrong
lesswrong.com › posts › 4yn8B8p2YiouxLABy › claude-sonnet-4-5-system-card-and-alignment
Claude Sonnet 4.5: System Card and Alignment
Sonnet 4 failed their rubric in many of these areas 20%-40% of the time, which seems unacceptably high, whereas with Sonnet 4.5 most areas are now below 5% failure rates, with especially notable improvement on biological and deadly weapons. It’s always interesting to see which concerns get tested, in particular here ‘romance scams.’ · For Claude Sonnet 4.5, our multi-turn testing covered the following risk areas: