Videos
Claude Code-Sonnet 4.5 >>>>>>> Gemini 3.0 Pro - Antigravity
Claude Sonnet 4.5 best AI for code generation, thoughts on worthy rivals?
Introducing Claude Haiku 4.5: our latest small model.
All this talk about Claude Sonnet 3.5 being good...
Well, without rehashing the whole Claude vs. Codex drama again, we’re basically in the same situation except this time, somehow, the Claude Code + Sonnet 4.5 combo actually shows real strength.
I asked something I thought would be super easy and straightforward for Gemini 3.0 Pro.
I work in a fully dockerized environment, meaning every little Python module I have runs inside its own container, and they all share the same database. Nothing too complicated, right?
It was late at night, I was tired, and I asked Gemini 3.0 Pro to apply a small patch to one of the containers, redeploy it for me, and test the endpoint.
Well… bad idea. It completely messed up the DB container (no worries, I had backups even though it didn’t delete the volumes). It spun up a brand-new container, created a new database, and set a new password “postgres123”. Then it kept starting and stopping the module I had asked it to refactor… and since it changed the database, of course the module couldn’t connect anymore. Long story short: even with precise instructions, it failed, ran out of tokens, and hit the 5-hour limit.
So I reverted everything and asked Claude Code the exact same thing.
Five to ten minutes later: everything was smooth. No issues at all.
The refactor worked perfectly.
Conclusion:
Maybe everyone already knows this, but the best benchmarks even agentic ones are NOT good indicators of real-world performance. This all comes down to orchestration, and that’s exactly why so many companies like Factory.AI are investing heavily in this space.
We have just plugged in Claude Sonnet 4.5 into our WordPress plugin solution and it is such a dream to use. We prefer plugging in the mid-end models that are not token-cost hungry and after extensive testing using this model to create for example a complex booking forms with calendar, it seems to do the best job out of our current list of AI service providers especially when it comes to refactoring code, adding more functions/features (e.g. modifying the form), theme and look and feel mods, and general optimisations. OpenAI with GPT-5 Mini and Nano is also pretty good, Grok on the other hand is not quite there with code generation yet.
What are your thoughts on Claude Sonnet 4.5?
Also, we wish to add another few AI providers, any recommendations?
Five months ago, Claude Sonnet 4 was state-of-the-art. Today, Haiku 4.5 matches its coding performance at one-third the cost and more than twice the speed.
Haiku 4.5 surpasses Sonnet 4 on computer use tasks, making Claude for Chrome even faster.
In Claude Code, it makes multi-agent projects and rapid prototyping markedly more responsive.
Sonnet 4.5 remains the best coding model in the world. Haiku 4.5 gives you near-frontier performance with greater cost-efficiency.
Use them together: Sonnet can build multi-step plans, then orchestrate a team of Haikus to complete subtasks in parallel.
Devs can use Claude Haiku 4.5 on our API, Amazon Bedrock, and Google Cloud’s Vertex AI.
It's a drop-in replacement for both Haiku 3.5 and Sonnet 4 and is available to all users today.
Read more: https://www.anthropic.com/news/claude-haiku-4-5
I swear Claude has an army of bots posting how much better it is than OpenAI.
I use both, all day every day for programming, switching back and forth. Sometimes one can help me get to the next step while the other can't. Sometimes it takes both.
But, in no way, IMHO, is Claude Sonnet 3.5 vastly better than OpenAI GPT 4o.
"Speechless", "The difference is insane", and so on... What the hell?
It's more like "yeah, it's ok", or "it's comparable".
Am I being trolled? Is everyone here a bot? Anyone else notice this or do you think I'm out to lunch?!?