sonnet 4.5 vs gemini 3 pro

[DISCUSSION] Is Gemini 3.0 really better than Claude Sonnet 4.5/Composer for coding?

reddit.com › r › cursor › comments › 1p0gr0s › discussion_is_gemini_30_really_better_than_claude

UI wise it's better than anything else out there by miles based on my testing. There's no competition when it comes to frontend. The benchmarks show Claude is a bit better on swe bench so that means their are some cases where Claude is the better candidate for your code. Answer from yaboyyoungairvent on reddit.com

reddit.com › r/cursor › [discussion] is gemini 3.0 really better than claude sonnet 4.5/composer for coding?

r/cursor on Reddit: [DISCUSSION] Is Gemini 3.0 really better than Claude Sonnet 4.5/Composer for coding?

November 18, 2025 -

I've been switching back and forth between Claude Sonnet 4.5 or Composer 1 and Gemini 3.0 and I’m trying to figure out which model actually performs better for real-world coding tasks inside Cursor AI. I'm not looking for a general comparison.

I want feedback specifically in the context of how these models behave inside the Cursor IDE.

Videos

06:59

YouTube

They did not see this coming… | Gemini 3 Pro vs Claude Sonnet ...

November 24, 2025

24:39

YouTube

Gemini 3 vs Claude Sonnet 4.5 - which model actually codes ...

November 21, 2025

42:07

YouTube

Ultimate Battle! Claude 4.5 Sonnet vs Gemini 3 Pro | Who's Best Coder?

November 19, 2025

45:23

YouTube

I Tested Claude 4.5 Sonnet vs Gemini 3 Pro for Coding - The Winner ...

November 21, 2025

reddit.com

r/cursor on Reddit: Gemini 3 is disappoint GPT 5.1 and sonnet 4.5 ...

November 22, 2025

View all

reddit.com › r/bard › it seems opus 4.5 is just too amazing even compared to gemini 3

r/Bard on Reddit: It seems opus 4.5 is just too amazing even compared to gemini 3

November 24, 2025 - I was testing Gemini 3 Pro and Sonnet 4.5 side by side yesterday, and to my shock, Sonnet 4.5 is a lot better on instructions following, creativity, and doesn't hallucinate as much.

Getpassionfruit

getpassionfruit.com › blog › gpt-5-1-vs-claude-4-5-sonnet-vs-gemini-3-pro-vs-deepseek-v3-2-the-definitive-2025-ai-model-comparison

GPT 5.1 vs Claude 4.5 vs Gemini 3: 2025 AI Comparison

1 month ago - Gemini 3 Pro leads overall reasoning benchmarks with an unprecedented 1501 LMArena Elo, becoming the first model to break the 1500 barrier, while Claude 4.5 Sonnet dominates real-world coding at 77.2% SWE-bench and DeepSeek-V3.2 delivers ...

Medium

medium.com › ai-software-engineer › claude-opus-4-5-is-here-and-beats-gemini-3-pro-swe-by-4-7-i-tested-it-e3887df3ed04

Claude Opus 4.5 Is Here (And Beats Gemini 3 Pro SWE) — I Tested It | by Joe Njenga | AI Software Engineer | Nov, 2025 | Medium

November 26, 2025 - Anthropic just released the Claude Opus 4.5 model, and it built this app in 2 minutes and just made Gemini 3 Pro look weaker.

Clarifai

clarifai.com › home › gemini 3.0 vs gpt-5.1 vs claude 4.5 vs grok 4.1: ai model comparison

Gemini 3.0 vs GPT-5.1 vs Claude 4.5 vs Grok 4.1: AI Model Comparison

2 weeks ago - By late 2025, a new generation of large‑language models (LLMs) has appeared that pushes the boundaries of reasoning, context memory and emotional intelligence. Google’s Gemini 3.0 Pro, OpenAI’s GPT‑5.1, Anthropic’s Claude Sonnet 4.5 and xAI’s Grok 4.1 represent the cutting edge.

AceCloud

acecloud.ai › blog › claude-opus-4-5-vs-gemini-3-pro-vs-sonnet-4-5

Claude Opus 4.5 Vs Gemini 3 Pro Vs Sonnet 4.5 Comparison Guide

November 25, 2025 - Pick Gemini 3 Pro if you need very strong multimodal performance, a 1M-token context window by default, and tight integration with Google tools and Search. Pick Claude Opus 4.5 if you care most about frontier coding performance, deep reasoning ...

x.com › sdrzn › status › 1990886120654344300

Gemini 3 Pro is the best of Claude Sonnet 4.5 (coding ...

Gemini 3 Pro is the best of Claude Sonnet 4.5 (coding, agentic thinking) and Gemini 2.5 Pro (actually handles 1m context well). It felt like model improvements got linear seeing how the jump from Sonnet 3.7 → 4 and GPT 4 .1 → 5 felt, but ...

Find elsewhere

Google Bing Mojeek

Data Studios

datastudios.org › post › google-gemini-3-vs-claude-sonnet-4-5-full-report-and-comparison-of-features-capabilities-pricing

Google Gemini 3 vs. Claude Sonnet 4.5: Full Report and Comparison of Features, Capabilities, Pricing, and more

November 22, 2025 - Both models push the boundaries of what AI can do, but they come with different strengths and design philosophies. This report provides a comprehensive comparison across key dimensions: from raw reasoning prowess and coding skills to multimodal ...

Glbgpt

glbgpt.com › hub › gemini-3-pro-vs-claude45

Gemini 3 Pro vs Claude 4.5: I Tested Both for Coding – Here’s the Surprising Winner

November 20, 2025 - In other words, Gemini 3 Pro feels like a very powerful but sometimes unpredictable senior engineer: brilliant at certain tasks, but you have to supervise it closely. Claude 4.5 (especially the Sonnet variant) has built a reputation as one of ...

Jduncan

jduncan.io › blog › 2025-11-20-google-antigravity-gemini-3-first-impressions

Gemini 3 Pro vs Claude Sonnet 4.5: Antigravity IDE Review

November 20, 2025 - TechRadar ran a comparison where Gemini 3 Pro built a working Progressive Web App with keyboard controls without being asked. Claude struggled with the same prompt. The benchmark data backs this up. Gemini 3 Pro scored 2,439 on LiveCodeBench Pro compared to Claude Sonnet 4.5’s 1,418.

AssemblyAI

assemblyai.com › blog › gemini-3-pro-vs-gpt-5-vs-claude-4-5

Gemini 3 Pro vs GPT-5 vs Claude 4.5: Which model wins for audio workflows?

November 20, 2025 - Gemini 3 Pro brings smarter summaries and actionable insights to audio workflows. Compare it to GPT-5, Claude 4.5, and other leading LLMs.

Simtheory

simtheory.ai › models › compare › gemini-3-pro-preview › vs › claude-4.5-sonnet

Gemini 3 Pro vs Claude 4.5 Sonnet: AI Model Comparison | Simtheory

November 18, 2025 - Gemini 3 Pro: Google's most advanced AI model combining breakthrough reasoning depth, native multimodal understanding, and state-of-the-art agentic capabilities to help you learn, build, and plan anything · The latest Claude 4.5 Sonnet model ...

The Algorithmic Bridge

thealgorithmicbridge.com › p › google-gemini-3-just-killed-every

Google Gemini 3 Is the Best Model Ever. One Score Stands Out Above the Rest

November 18, 2025 - Gemini 3 Pro earned ~$5.5k on Vending-Bench 2, the vending machine benchmark (it tries to answer a valuable real-world question: Can AI models run a profitable business across long horizons?), compared to ~$3.8k from Sonnet 4.5.

TechRadar

techradar.com › ai platforms & assistants

I tested Gemini 3, ChatGPT 5.1, and Claude Sonnet 4.5 – and Gemini crushed it in a real coding task | TechRadar

November 18, 2025 - Also: Make the ring look a bit ... · Gemini 3 Pro generated a Version 2.0 with, among other updates, "CSS Perspective to tilt the [ring] floor," and drama in the form of "the whole camera shakes when a heavy hit lands."...

reddit.com › r/claudeai › claude code-sonnet 4.5 >>>>>>> gemini 3.0 pro - antigravity

r/ClaudeAI on Reddit: Claude Code-Sonnet 4.5 >>>>>>> Gemini 3.0 Pro - Antigravity

November 22, 2025 -

Well, without rehashing the whole Claude vs. Codex drama again, we’re basically in the same situation except this time, somehow, the Claude Code + Sonnet 4.5 combo actually shows real strength.

I asked something I thought would be super easy and straightforward for Gemini 3.0 Pro.
I work in a fully dockerized environment, meaning every little Python module I have runs inside its own container, and they all share the same database. Nothing too complicated, right?

It was late at night, I was tired, and I asked Gemini 3.0 Pro to apply a small patch to one of the containers, redeploy it for me, and test the endpoint.
Well… bad idea. It completely messed up the DB container (no worries, I had backups even though it didn’t delete the volumes). It spun up a brand-new container, created a new database, and set a new password “postgres123”. Then it kept starting and stopping the module I had asked it to refactor… and since it changed the database, of course the module couldn’t connect anymore. Long story short: even with precise instructions, it failed, ran out of tokens, and hit the 5-hour limit.

So I reverted everything and asked Claude Code the exact same thing.
Five to ten minutes later: everything was smooth. No issues at all.
The refactor worked perfectly.

Conclusion:
Maybe everyone already knows this, but the best benchmarks even agentic ones are NOT good indicators of real-world performance. This all comes down to orchestration, and that’s exactly why so many companies like Factory.AI are investing heavily in this space.