opencode vs codex reddit - Brave Search

Any difference when using GPT model inside Codex vs OpenCode?

reddit.com › r › opencodeCLI › comments › 1r6marc › any_difference_when_using_gpt_model_inside_codex

I think both claude code and codex have some magic sauce to work better with their respective models. I personally think codex + 5.3 codex is way ahead of opencode + 5.3 codex. I'm realising now the harness matters just as much as the model these days. Answer from itsjase on reddit.com

reddit.com › r/opencodecli › any difference when using gpt model inside codex vs opencode?

r/opencodeCLI on Reddit: Any difference when using GPT model inside Codex vs OpenCode?

February 16, 2026 -

I'm a die-hard fan of OpenCode - because of free model, how easy it is to use subagents, and just because it's nice. But I wonder if anyone finds GPT models better in Codex? I cannot imagine why they could possibly work better there, but maybe models are just trained that way, so they "know" the tools etc? Anyone noticed anything like that?

I think both claude code and codex have some magic sauce to work better with their respective models. I personally think codex + 5.3 codex is way ahead of opencode + 5.3 codex. I'm realising now the harness matters just as much as the model these days.

This is what people mean when they say "the harness". Using it in codex means you get the bare bones experience. Not bad but it just means it isn't fine tuned to work specifically for coding /what you would want for coding at least. There's a concern for over engineering but that's why things are open source. Claudecode for example has a lot of changes to its system prompt to work well for everything Claude code will try to do like call mini agents and tools during it's execution. Codex afaik actually has nothing but I could be wrong. GitHub copilot has its own too supposedly tuned for multi model workflows, and opencode has their own as well. Imo all models work better in opencode. Sometimes this changes with select models, but it's a safe bet just to use opencode.

reddit.com › r/opencodecli › opencode vs codex cli: same prompt, clearer output — why?

r/opencodeCLI on Reddit: Opencode vs Codex CLI: Same Prompt, Clearer Output — Why?

January 16, 2026 -

Hi everyone! I came across Opencode and decided to try it out—was curious. I chose Codex (I have a subscription). I was genuinely surprised by how easy it was to communicate in planning mode with gpt-5.2-low: discussing tasks, planning, and clarifying details felt much smoother. Before, using the extension or the CLI was pretty tough—the conversation felt “dry.” But now it feels like I’m chatting with Claude or Gemini. I entered the exact same command—the answers are essentially the same, but Opencode explains it much more clearly. Could someone tell me what the secret is?

edit#1:
Second day testing the opencode + gpt-5.2-medium setup, and the difference is huge. With Codex CLI and the extension, it was hard for me to properly discuss tasks and plan — the conversation felt dry. Here, I can spend the whole day calmly talking things through and breaking plans down step by step. It genuinely feels like working with Opus, and sometimes even better. I’m using it specifically for planning and discussion, not for writing code. I don’t fully understand how opencode achieves this effect — it doesn’t seem like something you can get just by tweaking rules. With Codex CLI, it felt like talking to a robot; now it feels like talking to a genuinely understanding person.

Coding agent clients (sometimes referred to as harnesses) have their own prompt engineering flows. those include the system prompt, compacting strategies, context lookups, code indexing and more. Behind the scenes you plugged them to the same model with the same user prompt, but the model received a whole lot of different prompt that just happens to include a part of it that is identical

Videos

Plan with Claude and hand off to Codex to implement in a ...

r/opencodeCLI on Reddit: Why do you guys use opencode?

r/LLMDevs on Reddit: Intent - Work with Claude Code, Codex, OpenCode ...

February 11, 2026

I Tested Claude Code, OpenCode, and Gemini - Codex Still Wins - ...

February 24, 2026

Opencode Vs Claude Code Vs Codex | Which Code Generation Tool Is ...

February 4, 2026

OpenCode Review: Because I Hit Claude Code Usage Limits (Claude ...

January 11, 2026

reddit.com › r/claudecode › claude code vs codex vs opencode, which one is actually worth using?

r/ClaudeCode on Reddit: Claude Code vs Codex vs OpenCode, which one is actually worth using?

1 week ago -

From what I’ve seen so far, Claude Code seems to have the best overall reviews in terms of quality and performance. The main downside for me is that it’s locked behind a company and not open source (I know about the leak, but I’m more interested in something officially open and actively maintained).

Codex, on the other hand, looks really appealing because it’s open source and allows for forks, which gives it a lot more flexibility and long-term potential.

Then there’s OpenCode, probably the most interesting of the three. It has a huge community and a lot of momentum, but I’m not sure if it’s actually on par with the others in real-world use.

Curious to hear your thoughts, how do these compare in practice? Is OpenCode actually competitive, or is it more hype than substance?

Oh and by Claude i'm referring to the open sourced forks that are comming, which we don't know if will be updated or etc, not using the proprietary one ever

Codex for sure. The agentic capability is better or equal to CC You won’t run out of limit after 5 questions lol

they're all good at different things and none is 'best' across the board. claude is still the strongest for complex reasoning and multi-file refactors. codex is fast and cheap for straightforward tasks. the real answer is you probably need more than one, and the question becomes how you split work between them without manually switching every time.

reddit.com › r/codex › how does the codex app compare to opencode?

r/codex on Reddit: How does the Codex app compare to Opencode?

February 4, 2026 -

Was wondering of peoples opinions on using the Codex app from people who have used opencode in the past. I basically exclusively use OpenAI models for my workflow through opencode and wondering what makes it worth the switch to the codex app

It will be very interesting to see the responses for.

I don't think there is a correct answer here, because it all depends on your workflow. I've never used the codex app, but I recently switched from claudecode to opencode, mainly because of its more advanced permission system for subagents and the ability for subagents to spawn subagents. Also, my workflow is similar in concept to spec-driven, so it feels more natural in opencode, where the internal tools and instructions don't try to do things “their way” as in claudecode, which is becoming more and more opinionated. In general, I think that if these advantages of opencode are not important to you, then as a minimum, you are unlikely to lose anything by trying the codex app, and you may even gain something, who knows.

reddit.com › r/codex › opencode cli vs codex cli

r/codex on Reddit: Opencode cli vs Codex cli

January 19, 2026 -

Recently opencoded added chatgpt support (not just api key, subscription models work too). Anybody used that? How does opencode cli perform against codex cli?

I haven’t tried opencode yet but people on here have said codex cli pure is better. Will be testing it out this week though.

It’s a good tool and can be configured to use multiple models. OTOH it’s using opentui which breaks some features of my terminal. What do you want to know with "perform"? Any test case you have in mind?

reddit.com › r/opencodecli › using codex gpt-5.3 (high) in opencode better than just in terminal (inside vsc)?

r/opencodeCLI on Reddit: Using Codex GPT-5.3 (high) in opencode better than just in terminal (inside VSC)?

February 14, 2026 -

Hi,

What are the advantages of using Codex GPT-5.3 (high) inside opencode then using Codex in a traditional way in terminal. It's for inside VSC and mostly projects that revolves around php, js and/or Laravel projects to give you guys a bit more context.

Talking about context, does working with Codex inside change anything with the context window?

I know the biggest advantage for opencode is that you can switch models but apart from that I'm wondering what opencode can do more in terms of advantages than just using codex in terminal in VSC (instead of opencode inside terminal in VSC).

Thank you all!

PS: I'm not a native English speaker and didn't use AI to rewrite my text so hopefully it was understandable :)

Codex is pretty barebones. If you notice, it does use sed commands pretty often to read files, and codex does not have a lsp for the language of your choice. Opencode is just a better software that can really make gpt 5.3 shine.

How did you manage to have Codex 5.3 in OpenCode? Mine lists the model but when I try to use it I get an error that the model is not available.

reddit.com › r/codex › codex in opencode

r/codex on Reddit: Codex in OpenCode

December 30, 2025 -

Fellow Codex users, anyone using codex in OpenCode or https://github.com/code-yeongyu/oh-my-opencode? I want to know what the general consensus is on this, whether it’s advised or if you think just using Codex cli is possibly better. Im seeing lots of hype with OpenCode so want to hear people’s thoughts and if they’ve tried it. (Also if you use codex with it does it charge to your api key or you can use your weekly codex limit from chatgpt plan?) Thanks.

Codex cli works fine. Not going to overthink it.

Opencode is cool and yes normally it works with API but they allow Claude Max subscriptions natively and codex subscription via plugin (afaik), I use the API key so you may need to ask someone else. When comparing opencode with another tool, it's better to compare it with CC than codex, as a TUI. Because opencode is so ahead of codex in feature parity and trades blows with CC in QOL features. I use it as my daily driver and it has nearly all the features I want and if not, there are plugins. The automatic LSP and formatter integrations will save you a lot of compilation errors because the agent automatically gets the errors fed back to it when it makes syntax errors or similar, which is very valuable. Automatic formatting will make your code cleaner, the codex team have instructions in their AGENTS.md for how to run just fmt after every step to ensure formatting etc, this comes automatically in opencode, something that even CC doesn't have. They're now adding native LSP tools like symbol searching, renaming etc so soon the agent will not be globing and rg-ing its way around

reddit.com › r/codex › codex raw or app or opencode?

r/codex on Reddit: Codex raw or app or opencode?

February 14, 2026 -

Like the title suggested which one gives better performance from yall experience?

lmao... codex "raw"?

“Raw” and app are the same; the app is just a very nice UI on top of the “raw”/CLI. Opencode is a different harness with a different system prompt and tool selection. OpenAI models are trained on V4 diff apply_patch tool that’s ingrained in their system, and they can do crazy stuff with it like multi-edit 20 files in one go surgically. It’s so good at 0 cost (the agent doesn’t need to learn it, it’s in its training) that nearly all harnesses like opencode and roo now use it instead of their own custom tools. Codex is also trained natively on a todo tool, but they specify that just using the same tool API contract is enough and it won’t matter for performance. These are things you need to know if you want to use the model effectively and maximize its performance. Also, Codex CLI can use the experimental features like subagents while the app won’t. I don’t know why. The app, on the other hand, has one of the best worktree implementations in the app, and Whisper is shipped with it, which makes it very nice to use. That’s another point.

Find elsewhere

Google Bing Mojeek

reddit.com › r/codex › codex cli vs gpt-5.2 codex on opencode — which do you prefer?

r/codex on Reddit: Codex CLI vs GPT-5.2 Codex on OpenCode — which do you prefer?

February 2, 2026 -

Hey everyone,

I’ve been seeing a lot of people moving from Claude code to Codex, so I’m thinking about giving it a try.

For those who’ve used it:

Do you prefer Codex CLI or GPT-5.2 Codex on OpenCode?

What’s the best way to use Codex day-to-day (workflow, setup, tips)?

Thanks!

I tried this. From personal experience, emphatically can say just use Codex CLI :) It is built for it. And more importantly it's rapidly improving.

Opencode sucks when it comes to long convos, compact is not using the true power of remote compact (which is excellent). Protocol is half implemented. Vanilla opencode has the wrong context size configured for the models. I put in a patch to fix all these issues that got rejected - the devs don't really want to concentrate on chatgpt support. Avoid. The latest codex CLI is far far better, especially with collab model, multi-agent, background tasks etc.

reddit.com › r/opencodecli › codex plus or opencode go ?

r/opencodeCLI on Reddit: codex plus or opencode go ?

1 month ago -

Hey :),

ist codex plus or opencode go a better deal for my money ? I don't won't to spent more than 20 $ for my cli ai agent. What's the best deal for my money ? I don't want to vibe code I use ai only for questions debugging and simple task :)

Thanks and Best regards :)

I compared best value for money a lot yesterday after getting too fed up with antigravity nerfing usage limits to unuseably small. It looks like github copilot pro+ plan (40€) is best value for money, and you can use it via api on opencode. They also have 10€ pro plan, but i dont think its enough for many. Also the 10€ pro plan has 1 month free trial, so you can test it and see if you need pro+ or not. If you dont vibe around al lday, the 10€ pro plan likely is enough. And you do get unlimited bit smaller models use with it as well € = $ pretty much.

Codex will take you a long way, especially now with the 2x limits. OpenCode Go is only $10, so it's my way of supporting them while gaining access to Kimi and GLM 5, which are both fine models. Codex is great value for money at $20. Go is also excellent for half that price; I've never hit rate limits with them, though Kimi was failing from time to time.

reddit.com › r/codex › thank you openai for letting us use opencode with the same limits as codex

r/codex on Reddit: thank you OpenAI for letting us use opencode with the same limits as codex

February 25, 2026 -

switched to ChatGPT Pro not too long ago and i genuinely love codex - simple tool, does what it needs to do, no fluff

but opencode is on another level as a harness. subagents, grep tools, proper file navigation - it's a much more serious setup for real engineering work

and the fact that you're letting us use it freely with the same limits as codex is huge. props for not gatekeeping it unlike, well, you know who

appreciate it OpenAI, this is how you treat your users

Claude/Google dont allow this Thanks OpenAI

And if you export this env variable , you can still get the 2x usage for using the desktop app promo that goes through April 2nd. export CODEX_INTERNAL_ORIGINATOR_OVERRIDE="Codex Desktop"

reddit.com › r/opencodecli › why should i use my openai subscription with open code instead of plain codex?

r/opencodeCLI on Reddit: Why should I use my OpenAI subscription with Open Code instead of plain codex?

January 25, 2026 -

I’m really interested in the project since I love open source, but I’m not sure what are the pros of using OpenCode.

I love using Codex with the VSC extension and I’m not sure if i can have the same dev experience with Open Code.

The best thing about opencode is you can build workflows that utilize different LLM's. If you JUST have an openai subscription then you can have an orchestrator running a small gpt model, when it needs to create an SDD and dependency map it could use the xhigh gpt5, then kick off that result to build an implementation plan to codex, and then code it up with codex. You would get almost the exact same results as using GPT5 xhigh, but at a much reduced cost. Enough of a reduced cost that you could then have a dialectical agent come in behind top level steps and validate results. The downside is that there is a lot of learning. If you use opencode I'd suggest also grabing some popular plugins to check out what is possibe such as oh my opencode. OMO is kind of heavy, so its debatable on if it's a good daily driver BUT it absolutely is a good example of things that can be done in the system.

I’m using codex with open code ChatGPT plus subscription and never going back.

reddit.com › r/codex › gpt-5.3-codex + opencode is almost claude code + opus 4.6 level

r/codex on Reddit: GPT-5.3-codex + OpenCode is almost Claude Code + Opus 4.6 level

February 16, 2026 -

Opus 4.6 + Claude Code is insane, it 1 shots complicated changes across the code bases I work on professionally.

Locally, I was using the codex cli, but the results were always meh. Recently moved to use my ChatGPT Plus Subscription with OpenCode to use 5.3-codex, and the harness is soooo much better than the codex CLI, Mac App, or VS Code extension.

Results are for certain always higher quality, it feels like the OpenCode is able to somehow provide much better context.

The one thing I haven't been able to figure out is - How can I set the reasoning level for 5.3-codex via OpenCode.

Very strange that opencode would nerf Codex 5.3 down to Opus 4.6's level, but you don't have to use it - just stick with Codex CLI and you'll get the better performance you expect.

My findings are the opposite. Love OpenCode (and really appreciate OpenAI rubber-stamping their sub use within it) but I find Codex a bit weaker within it. Point it to an Office document - and you get a generic "can't read binary" refusal; whereas within Codex CLI it 'knows' to use scripting to read it. Tool calling is a tad more robust in Codex CLI as well. Codex CLI also supports vision (ie, paste in/reference an image and it can 'see' it); OpenCode requires an MCP for this which is usually inferior. OpenCode also segment-faults a lot under Windows (due to issues with the underlying Bun library); whereas Codex (save for the missing-text bug) runs pretty stable. (I've swapped back to WSL for OC). Still, OC is my daily driver because these weaknesses are outshined by the ability to swap models from different providers mid-chat as necessary (!), the awesome text formatting/theming, the mouse support, session searching and access to the menu even with a prompt typed. Hit CTRL+T to adjust reasoning level.

reddit.com › r/opencodecli › benefit of oc over codex 5.3

r/opencodeCLI on Reddit: Benefit of OC over codex 5.3

February 24, 2026 -

Hi all. Can anyone tell me the benefit of using codex via oauth in opencode CLI over just using codex CLI?

At the moment my workflow is to chat through my ideas with ChatGPT. Formulate a plan and then hand that off to Codex with guardrails. Codex makes the changes to my codebase, produces a diff and a summary which ChatGPT checks and if we’re happy, I commit and push. All in a Linux VM using codex in VScode IDE.

So, what would OC bring to the table!?

So far I’ve made an off-market property sourcing app using python to make API calls to enrich a duckdb database, surface it in streamlit and pump out communications and business information material. It’s all been mega new to me. I can’t code and hadn’t even touched AI never mind heard of python before sep 24 which is why I need to source lots and lots of advice using a chatbot before committing to a certain direction.

This is just the beginning for me and I read non-stop on the subject. It’s all incredibly exciting and I’m obsessed with the possibilities for this app and beyond.

So you're wondering what OpenCode brings to the table versus just using Codex CLI directly, right? The main thing is choice. Codex CLI locks you into OpenAI models only, but OpenCode gives you access to tons of providers, models and even local models via Ollama. You can see how this matters when you want to experiment without hitting usage limits or when you just want cheaper options for simple tasks. Personally, I like the sub agent system it has. I can easily define some sub agents, from different model and it would nicely hand that off. It's also free and open-source. For some of the providers, you bring your own API keys and only pay for what you use, versus needing a ChatGPT Plus/Pro subscription. For your Python learning journey, this means you can test different models to see which explains concepts best for your style. The terminal UX is nicer too. You get LSP support for better code completion, instant model switching with hotkeys, and a responsive UI built by people who actually care about terminals. Plus OpenCode stores zero code or context data, which matters if you're handling sensitive property data. That said, Codex CLI is faster (and simpler) and has built-in review commands that OpenCode lacks. If you're happy with your current ChatGPT + Codex workflow, you might not need to switch. But if you want flexibility without subscription lock-in, OpenCode is probably worth a look. They say, don't fix whats not broken. PS: I use codex with opencode frequently.

Multi model agentic approach. I have codex creating code and Kimi doing a review. After feature implementation is complete I also do a further review with security in focus with some free model from opencode zen. I also found codex in OC better equipped with tools calling.

reddit.com › r/opencodecli › opencode vs cc

r/opencodeCLI on Reddit: Opencode vs CC

January 3, 2026 -

I’m trying to figure out what the differences between opencode and cc are when it come to actual output, not the features they have per se and how we can make the most use of these features depending on usecases.

I had a recent task to investigate an idea I had and create an MVP for it. So starting with a clean slate I gave the same prompt in opencode using Claude sonnet 4.7 and also GLM4.7. And in Claude it was sonnet 4.5.

The output from Claude code was way more general and it came back with questions slightly relaxant but not directly part of the main prompt. Clarifying them gave a broader scope to the task.

Opencode on the other hand, directly provided suggestions for implementation with existing libraries and tools. This was the same/similar output for both the models.

I’m interested to know what workflows others have and how they choose the best tool for the job. Or if you have any special prompts that you use would love to heard from you.

I recently converted all of my CC skills, agents, slash commands to OpenCode. I have not found many major differences in performance - but I like to imagine that is because I have such a tight development loop using the skills, agents and slash commands. I would say I do miss the interactive questioning that CC recently offered, but I am sure that is coming at some point in the future. I also more recently tested out SpecKit on OpenCode with some success as well - I just feel like the tighter the development loop and approach, the more predictable it's going to be regardless of CLI choice. I have now deleted my Claude Code files and am all in on Open Code. Also - I strongly dislike GLM 4.7, another test from the holiday - it simply writes bad code, everytime.

I’ve found that almost any tool I use, I can get good code out of it, but it does take some work to get things setup. Since my earliest trials of agentic tools for coding, I focused on the process and how I give the agents clear guide rails. I started with GitHub Copilot, as that’s what I initially had access to at work. I got myself a Copilot plan for my personal projects and used the VSCode insiders build so I could play with subagents. I setup a whole team of subagents to research, plan, implement, review and then one “Conductor” agent to orchestrate them all. I immediately found better outcomes no matter which model I threw at it. Mostly stuck to Opus for planning/orchestrating and Sonnet for the rest. I then got access to Claude Code and ported my same workflow over there. It has better support for subagents, which is nice, and since I was mostly using Claude models, it fit the workload well. The main issue I found it solved over Copilot was that the subagents in Copilot don’t respect the model setting in the agent files, so it only ever uses the same model for subagents as your primary orchestrating agent. I didn’t care for that. Last week, I decided to give OpenCode a try, as I’ve been hearing good things and had some time off. I rigged up OpenCode with my Claude plans, OpenRouter API, and also got the Z.ai coding plan on sale and added it to OpenCode as well. I ported over my Orchestration pattern and subagents to OpenCode and it worked quite well. It actually respects the model setting in the subagent files, and I really like the granular control of tools and commands. I initially tested with my standard collection of Claude models (Opus for planning and review, Sonnet for implementation) and it worked flawlessly. I then decided to try GLM-4.7 for implementation. GLM-4.7 isn’t as good at implementation, but it does still get the job done. I suspect that’s because of how strict my subagent files are with instructions about strict TDD and following the plan Opus made. I then have Opus review the code and then do a code review myself. With this pattern, I’d say I have about a 95% success rate in getting good code out of almost any model I throw at my issue. It is slow and methodical, but as the saying goes, “slow is smooth, smooth is fast”. I rarely have to revisit a feature or bug fix. Part of the planning process my Conductor does involves invoking subagents to research the code and hit MCP servers to search documentation, both context7 and web fetching. I do this in a dedicated research subagent as those MCP servers can end up eating 25-50% of a context window. By having the researcher do that and then just return to the conductor with a plan for what needs to be done, I can keep my conductor context clean and concise. I have the conductor make a multiphase plan for each feature or fix I need. I then have it create the plan with the researcher subagent, and present it to me. I review the plan, agree, and it writes it to a markdown file in a plans directory. I then have it start implementing using the implement subagent, review the code with the review subagent, and finally present the completed code for the phase to me. It pauses at the end of the phase, I review, make the commit, and tell it to move on to the next phase. Repeat this until it’s all done. With this, I’ve been able to break down and complete even complex tasks with ease. It has worked with almost any model I’ve thrown it at, but the better models do tend to get the right answer faster. I’ve open sourced my Copilot setup, but I plan to do the same with my OpenCode setup soon too.

reddit.com › r/ollama › which is better for coding: claude code, codex, opencode, or openclaw? and which cloud-based ollama model works best with the strongest of these coding tools?

r/ollama on Reddit: Which is better for coding: Claude Code, Codex, OpenCode, or OpenClaw? And which cloud-based Ollama model works best with the strongest of these coding tools?

February 27, 2026 -

Which is better for coding: Claude Code, Codex, OpenCode, or OpenClaw?

And which cloud-based open source Ollama model works best with the strongest of these coding tools?

OpenClaw is not a coding tool.

I've been using glm 5 under ollama cloud inside opencode and openclaw with good results so far

reddit.com › r/openai › question about codex vs opencode (github copilot) context limits (with gpt-5)

r/OpenAI on Reddit: Question about Codex vs Opencode (github copilot) context limits (with GPT-5)

August 27, 2025 -

I’ve been testing the context window differences between Codex (GPT-5) and OpenCode with GitHub Copilot (GPT-5), and the gap looks surprisingly big.

I gave both the exact same prompt, asking them to read all the .md files in my workspace to load as much context as needed. These were the results:

Codex (GPT-5): after using 48,122 tokens it still reported 85% of context free, which means a total context window of around 400k tokens.

OpenCode with Copilot (GPT-5): after using 92.5k tokens it reported 72% already used, which works out to a total context window of about 128k tokens.

So if these numbers are correct, Codex has roughly 400k tokens of context while OpenCode with Copilot is limited to about 128k.

My question is: is this difference real, or am I misunderstanding how these tools report context usage? Has anyone else run into the same thing?

Unfortunately this is intended design, GH Copilot and as you mentioned some other tools as well limit, the context window of GPT 5 internally (api sided) so there is no way to change this. I believe this is to prevent long context being repeatedly shoved in to save them input token cost. (I know that their cache token hit rate is cheaper but nonetheless still a price), the only way to get around it in most of these tools is by using your own api key which then will let you configure it personally. The more limited context does make the model summarize and compress tokens more often which can sometimes degrade performance and long-horizon task performance. If you already have codex I would recommend using this for those longer tasks which could require a larger context and the 128k contexts for small-medium.

reddit.com › r/codex › i want the reasons why you use codex. currently trying it out and moving away from claude code

r/codex on Reddit: I want the reasons why you use Codex. Currently trying it out and moving away from Claude Code

February 4, 2026 -

Like the title says, I've been a Claude Code user and a fan of it, but after hearing the founder of OpenClaw say that he used Codex and he preferred it more, I decided to try it myself and I was pleasantly surprised at the experience just wondering if there are other reasons that you all like Codex over other AI coding tools since im still new to Codex. Any personal favorite features you have much appreciated <3

Much better limits on $20 plan

Higher quality coding ability, pretty much

reddit.com › r/opencodecli › benchmarking with opencode (opus,codex,gemini flash & oh-my-opencode)

r/opencodeCLI on Reddit: Benchmarking with Opencode (Opus,Codex,Gemini Flash & Oh-My-Opencode)

January 24, 2026 -

A few weeks ago my "Private-Reddit-Alter-Ego" started and participated in some discussions about subagents, prompts and harnesses. In particular, there was a discussion about the famous "oh-my-opencode" plugin and its value. Furthermore I discussed with a few people about optimizing and shortening some system prompts - especially for the codex model.

Someone told me - if I wanted to complain about oh-my-opencode, I shall go and write a better harness. Indeed I started back in summer with an idea, but never finished the prototype. I got a bit of sparetime so I got it running and still testing it. BTW: My idea was to have controlled and steerable subagents instead of fire-and-forget-style-text-based subagents.

I am a big fan of benchmarking and quantitative analysis. To clarify results I wrote a small project which uses the opencode API to benchmark different agents and prompts. And a small testbed script which allows you to run the same benchmark over and over again to get comparable results. The testdata is also included in the project. It's two projects, artificial code generated by Gemini and a set of tasks to solve. Pretty easy, but I wanted to measure efficiency and not the ability of an agent to solve a task. Tests are included to allow self-verification as definition of done.

Every model in the benchmark had solved all tasks from the small benchmark "Chimera" (even Devstral 2 Small - not listed). But the amount of tokens needed for these agentic tasks was a big surprise for me. The table shows the results for the bigger "Phoenix-Benchmark". The Top-Scorer used up 180k context and 4M tokens in total (incl cache) and best result was about 100k ctx and 800k total.

Some observations from my runs:

- oh-my-opencode: Doesn't spawn subagents, but seems generous (...) with tokens based on its prompt design. Context usage was the highest in the benchmark.

- DCP Plugin: Brings value to Opus and Gemini Flash – lowers context and cache usage as expected. However, for Opus it increases computed tokens, which could drain your token budget or increase costs on API.

- codex prompt: The new codex prompt is remarkably efficient. DCP reduces quality here – expected, since the Responses API already seems to optimize in the background.

- coded modded: The optimized codex prompt with subagent-encouragement performed worse than the new original codex prompt.

- subagents in general: Using task-tool and subagents don't seem to make a big difference in context usage. Delegation seems a bit overhyped these days tbh.

Even my own Subagent-Plugin (will publish later) doesn't really make a very big difference in context usage. The numbers of my runs still show that the lead agent needs to do significant work to get its subs controlled and coordinated. But - and this is not really finished yet - it might get useful for integrating locally running models as intelligent worker nodes or increasing quality by working with explicit finegrained plans. E.g. I made really good progress with Devstral 2 Small controlled by Gemini Flash or Opus.

That's it for now. Unfortunately I need to get back into business next week and I wanted to publish a few projects so that they don't pile up on my desk. In case anyone likes to do some benchmarking or efficiency analysis, here's the repository: https://github.com/DasDigitaleMomentum/opencode-agent-evaluator

Have Fun! Comments, PRs are welcome.

EDIT: Here you find a Opencode-Only implementation of my subagent framework: https://www.reddit.com/r/opencodeCLI/comments/1reu076/controlled_subagents_for_implementation_using/

Subagents are not about saving context in general, but about avoiding context bloat. When each agent and subagent strictly performs its specific task and each of them has only the information they need in their context, the quality of generation is much higher.

I never really see any usefulness in a tool like Oh-my-opencode. I don’t know why, but to me it just seems like it consumes a lot more tokens for very little improvement.

reddit.com › r/codex › opencode with gpt is next level

OpenCode with GPT is next level : r/codex

January 16, 2026 - I don't want to switch away from codex because this is the fastest and direct way to get new improvements from the team · also anthropic cut them off recently openai and can easily do the same ... Cool story OP, but you're comparing apples to oranges. How's OpenCode with GPT vs Codex with GPT?