What do people think of Google's Gemini (Pro?) compared to Claude for code? I re...

CuriouslyC · 2025-08-23T21:10:36 1755983436

Gemini is amazing for taking a merge file of your whole repo, dropping it in there, and chatting about stuff. The level of whole codebase understanding is unreal, and it can do some amazing architectural planning assistance. Claude is nowhere near able to do that.

My tactic is to work with Gemini to build a dense summary of the project and create a high level plan of action, then take that to gpt5 and have it try to improve the plan, and convert it to a hyper detailed workflow xml document laying out all the steps to implement the plan, which I then hand to claude.

This avoids pretty much all of Claude's unplanned bumbling.

seanwessmith · 2025-08-24T02:20:26 1756002026

mind typing this up? i've got a basic GPT -> Claude workflow going for now

CuriouslyC · 2025-08-24T02:36:20 1756002980

https://gist.github.com/githubcustomerserviceistrash/c716e76...

I should mention I made that one for my research/stats workflow, so there's some specific stuff in there for that, but you can prompt chat gpt to generalize it.

threecheese · 2025-08-24T18:51:41 1756061501

I mean, damn. Are terms like “executable oracles” and “hermetic boots” related to your domain, or are you using these as terms of art for an agent? Oracle being a source of truth, hermetic meaning no external dependencies or side effects - definitions in furtherance of your request for concise language. Would love to understand more.

CuriouslyC · 2025-08-25T03:49:31 1756093771

This prompt is for scientific research. In general my goal is to instruct the agent to build as much validation scaffolding as possible, so rather than holding its hand I can just give it a series of concrete hurdles and tell it not to come back until they're met. I don't want it finishing the basic tasks and coming back to me saying the app is "production ready," I want to come back after a few hours to the agent having "proven a spec" with a demo or a paper that I can iterate on.

koakuma-chan · 2025-08-23T20:12:12 1755979932

I don't think Gemini Pro is necessarily worse at coding, but in my experience Claude is substantially better at "terminal" tasks (i.e. working with the model through a CLI in the terminal) and most of the CLIs use Claude, see https://www.tbench.ai/leaderboard.

jsight · 2025-08-23T20:06:35 1755979595

For the web ui (chat)? I actually really like gemini 2.5 pro.

For the command line tool (claude code vs gemini code)? It isn't even close. Gemini code was useless. Claude code was mostly just slow.

lifthrasiir · 2025-08-24T08:38:46 1756024726

Yeah, the main strength of gemini-cli is being open-sourced and it still needs much polishing. I ended up building my own web-based interactive agent based on gemini-cli [1] out of frustration.

[1] https://github.com/lifthrasiir/angel

upcoming-sesame · 2025-08-23T22:55:19 1755989719

You mean Gemini CLI. Yeah it's confusing

jsight · 2025-08-23T23:55:50 1755993350

Thanks, that's the one!

Herring · 2025-08-23T22:39:01 1755988741

Yeah I was also getting much better results on the Gemini web ui compared to the Gemini terminal. Haven't gotten to Claude yet.

jonfw · 2025-08-23T20:22:47 1755980567

Gemini is better at helping to debug difficult problems that require following multiple function calls.

I think Claude is much more predictable and follows instructions better- the todo list it manages seems very helpful in this respect.

esafak · 2025-08-24T03:14:29 1756005269

I used to like it a lot but I feel like it got dumber lately. Am I imagining things or has anyone else observed this too?

divan · 2025-08-23T20:53:58 1755982438

In my recent tests I found it quite smart at analyzing bigger picture (i.e. "hey, test failing not because of that, but because of whole assumption has changed and let me rewrite this test from scratch". But it also got stuck few times "I can't edit file, I'm stuck, let me try completely differently". But the biggest difference so far is the communication style - it's a bit.. snarky? I.e. comments like "yeah, tests are failing - as I suspected". Why the f it suspected failing test on the project it sees for the first time? :D

poniko · 2025-08-24T12:28:45 1756038525

Pretty much every time Claude code is stuck or more or less just coding in circles i use Gemini PRO to analyze the code/data and feed the response into Claude to solve it. I also have much more success with Gemini when creating big sql transforming scripts or similar. Both are quite bad on bigger tasks, they get you 60% and then i spend days and days to trying to get to 100% .. its such a time sink when i select the wrong task for the llm.

Keyframe · 2025-08-23T20:09:28 1755979768

It's doing rather well at thinking, but not at coding. When it codes, often enough it runs in circles and ignores input. Where I find it useful is to read through larger codebases and distill what I need to find out from it. Even using gemini from claude to consult it for certain things. Opus is also like that btw, but a bit better at coding. Sonnet though, excels at coding.. from my experience though.

donperignon · 2025-08-24T06:03:17 1756015397

Personally gemini has been giving me better results. Claude keeps trying to generate react code even when the whole context and my command is svelte, and failing constantly to give me something that can at least run, gemini, on the other hand has been pretty good with styling, and useful with the bussines logic. I dont get all the hype around claude.

gedy · 2025-08-24T15:07:41 1756048061

Claude Code is just a nicer dev experience, especially with simpler stuff. I've seen Gemini do much better with Svelte as well.

yomismoaqui · 2025-08-23T19:56:38 1755978998

According to the guys from Amp Claude Sonnet/Opus are better at tool use.

ezfe · 2025-08-23T19:53:43 1755978823

Gemini frequently didn't write code for me for no explicable reason, and just talked about a hypothetical solution. Seems like a tooling issue though.

djmips · 2025-08-23T19:56:15 1755978975

Sounds almost human!

brabel · 2025-08-24T08:21:43 1756023703

LLMs are built on human content and they do behave similarly to humans sometimes, including both the good and the bad.

nicce · 2025-08-23T20:33:19 1755981199

If you could control the model with system command, it would be very good. But at last I have failed miserably. Model is too verbose and helpful.

stabbles · 2025-08-23T20:08:35 1755979715

In my experience it's better at lower level stuff, like systems programming. A pass afterwards with claude makes the code more readable.

filchermcurr · 2025-08-23T21:27:15 1755984435

The Gemini CLI tool is atrocious. It might work sometimes for analyzing code, but for modifying files, never. The inevitable conclusion of every session I've ever tried has been an infinite loop. Sometimes it's an infinite loop of self-deprecation, sometimes just repeating itself to failure, usually repeating the same tool failure until it catches it as an infinite loop. Tool usage frequently (we're talking 90% of the time) fails. It's also, frankly, just a bummer to talk to. The "personality" is depressed, self-deprecating, and just overall really weird.

That's been my experience, anyway. Maybe it hates me? I sure hate it.

klipklop · 2025-08-24T00:49:26 1755996566

This matches my experience with it. I won’t let it touch any code I have not yet safely checked in before firing up Gemini. It will commonly get into a death loop mid session that can’t be recovered from.

polotics · 2025-09-02T08:06:01 1756800361

this is so weird I am not at all getting the same experience, its tools work, it changes typescript and python confidently, makes mistakes, understands them and fixes them. I had a case of it giving up and admitting failure, but not in the way you describe

esafak · 2025-08-24T14:24:00 1756045440

Once it repeatedly printed shame in all caps. I was worried until I figured out it was talking to itself.

KaoruAoiShiho · 2025-08-23T19:53:29 1755978809

It sucks.

KaoruAoiShiho · 2025-08-23T22:17:29 1755987449

Lol downvoted, come on anyone who has used gemini and claude code knows there's no comparison... gimme a break.

bitpush · 2025-08-23T22:27:12 1755988032

You're getting down voted because of the curt "it sucks" which shows a level of shallowness in your understanding.

Nothing in the world is simply outright garbage. Even the seemingly worst products exist for a reason and is used for a variety of use cases.

So, take a step back and reevaluate whether your reply could have been better. Because, it simply "just sucks"

polotics · 2025-09-02T08:07:31 1756800451

can you detail the differences you see that substantiate your judgement?