Can someone explain to me the recent assumed downfall of Claude

reddit.com › r › Anthropic › comments › 1na3rip › can_someone_explain_to_me_the_recent_assumed

I loved Claude just as much as everyone. It has become as usable as gpt 3.5. Anyone who says it's a prompting issue or chalking it up some user psychology is crazy. It's become absolutely unusable. I'm used to nerfed models, but I've never had to take a step back to be like 'oh, this is fundamentally broken.' Answer from whatjackfound on reddit.com

reddit.com › r › ClaudeAI

ClaudeAI

January 23, 2023 - Reddit user agreement: https://www.redditinc.com/policies/user-agreement-september-25-2023 ... Promoting your project or paid service is possible here but if your site looks spammy/sketchy to us, we will ban it. Fully disclosing what the user is getting, how and who it helps, and what your association is with the project will lower (but not eliminate) the likelihood of banning. Your posting privileges vary with your helpful participation on the sub. ... Claude has vast amounts of diverse use cases.

reddit.com › r/anthropic › can someone explain to me the recent assumed downfall of claude

r/Anthropic on Reddit: Can someone explain to me the recent assumed downfall of Claude

September 6, 2025 -

I took a 2 week break from AI stuff and loved Claude going into, and come back and see tons switching to codex or cursor or what have you. Can someone explain to me the rundown of events of what has happened?

Top answer

1 of 61

2 of 61

It had a REALLY rough week to end August. It wasn't just haters or bots (I am not either of those) and I saw the regression. All that said, it is doing better so far this week (in my experience), so I stick with it. Codex is useful in spots but CC in general is working well for me again, if not still a notch below where it was say a month ago.

Videos

reddit.com

r/ClaudeCode on Reddit: How I Start Every Claude Code Project

2 days ago

reddit.com

r/ClaudeAI on Reddit: Mind-Blowing Experience with Claude Computer Use

October 23, 2024

reddit.com

r/ClaudeAI on Reddit: I'm blown away by Claude Code - built a full ...

May 29, 2025

reddit.com

r/ClaudeAI on Reddit: Let Claude modify a draw.io diagram, but ...

4 days ago

reddit.com

r/ClaudeAI on Reddit: Found an open-source tool (Claude-Mem) that ...

5 days ago

reddit.com

r/ClaudeAI on Reddit: Claude Island — Dynamic Island for Claude Code

1 week ago

View all

reddit.com › r/anthropic › why is claude always the best ai for coding？

r/Anthropic on Reddit: Why is Claude always the best AI for coding？

August 14, 2025 -

Even though other companies have consistently surpassed Claude in programming rankings, Claude has always been the best

Top answer

1 of 30

That's because Anthropics is focusing on the coding abilities of their models first and foremost. Other models are surpassing them in the synthetic benchmarks from time to time. But they don't care much about things like code migration strategies or keeping things backwards compatible.

2 of 30

tool usage. they have the magic sauce the others have yet to replicate. it makes a huge difference.

reddit.com › r › ClaudeCode

r/ClaudeCode

February 24, 2025 - a community where claude code enthusiasts build, share, and solve together.

reddit.com › r/chatgptcoding › claude is so good at coding its crazy!

r/ChatGPTCoding on Reddit: CLAUDE IS SO GOOD AT CODING ITS CRAZY!

June 4, 2025 -

I have been using Gemini 2.5 pro preview 05-06 and using the free credits because imma brokie and I have been having problems at coding that now matter what I do I can't solve and gets stuck so I ask Gemini to give me the problem of the summary paste it to Claude sonnet 4 chat and BOOM! it solves it in 1 go! And this happened already 3 times with no fail it's just makes me wish I can afford Claude but will just have to make do what I can afford for now. :)

Top answer

1 of 5

It's pretty good. But good damn, it's like it's on cocaine, does way to much and never stops

2 of 5

Claude more often that not can solve really hairy bugs better than gemini or chatgpt but there are some caveats tends to bloat code with over engineered structure which can fuck you up down the line, and also eat up your token limit May add unnecessary funtionality which will also eat up your token limit Im under the impression that they do this on purpose to convince you to pay for the service, in my case it worked

reddit.com › r › claude

r/claude

December 6, 2013 - r/claude: Community for Anthropic's generative AI model, Claude.

reddit.com › r/claudecode › claude code is a beast – tips from 6 months of hardcore use

r/ClaudeCode on Reddit: Claude Code is a Beast – Tips from 6 Months of Hardcore Use

October 31, 2025 -

Edit: Many of you are asking for a repo so I will make an effort to get one up in the next couple days. All of this is a part of a work project at the moment, so I have to take some time to copy everything into a fresh project and scrub any identifying info. I will post the link here when it's up. You can also follow me and I will post it on my profile so you get notified. Thank you all for the kind comments. I'm happy to share this info with others since I don't get much chance to do so in my day-to-day.

Edit (final?): I bit the bullet and spent the afternoon getting a github repo up for you guys. Just made a post with some additional info here or you can go straight to the source:

🎯 Repository: https://github.com/diet103/claude-code-infrastructure-showcase

Quick tip from a fellow lazy person: You can throw this book of a post into one of the many text-to-speech AI services like ElevenLabs Reader or Natural Reader and have it read the post for you :)

Disclaimer

I made a post about six months ago sharing my experience after a week of hardcore use with Claude Code. It's now been about six months of hardcore use, and I would like to share some more tips, tricks, and word vomit with you all. I may have went a little overboard here so strap in, grab a coffee, sit on the toilet or whatever it is you do when doom-scrolling reddit.

I want to start the post off with a disclaimer: all the content within this post is merely me sharing what setup is working best for me currently and should not be taken as gospel or the only correct way to do things. It's meant to hopefully inspire you to improve your setup and workflows with AI agentic coding. I'm just a guy, and this is just like, my opinion, man.

Also, I'm on the 20x Max plan, so your mileage may vary. And if you're looking for vibe-coding tips, you should look elsewhere. If you want the best out of CC, then you should be working together with it: planning, reviewing, iterating, exploring different approaches, etc.

Quick Overview

After 6 months of pushing Claude Code to its limits (solo rewriting 300k LOC), here's the system I built:

Skills that actually auto-activate when needed
Dev docs workflow that prevents Claude from losing the plot
PM2 + hooks for zero-errors-left-behind
Army of specialized agents for reviews, testing, and planning Let's get into it.

Background

I'm a software engineer who has been working on production web apps for the last seven years or so. And I have fully embraced the wave of AI with open arms. I'm not too worried about AI taking my job anytime soon, as it is a tool that I use to leverage my capabilities. In doing so, I have been building MANY new features and coming up with all sorts of new proposal presentations put together with Claude and GPT-5 Thinking to integrate new AI systems into our production apps. Projects I would have never dreamt of having the time to even consider before integrating AI into my workflow. And with all that, I'm giving myself a good deal of job security and have become the AI guru at my job since everyone else is about a year or so behind on how they're integrating AI into their day-to-day.

With my newfound confidence, I proposed a pretty large redesign/refactor of one of our web apps used as an internal tool at work. This was a pretty rough college student-made project that was forked off another project developed by me as an intern (created about 7 years ago and forked 4 years ago). This may have been a bit overly ambitious of me since, to sell it to the stakeholders, I agreed to finish a top-down redesign of this fairly decent-sized project (~100k LOC) in a matter of two to three months...all by myself. I knew going in that I was going to have to put in extra hours to get this done, even with the help of CC. But deep down, I know it's going to be a hit, automating several manual processes and saving a lot of time for a lot of people at the company.

It's now six months later... yeah, I probably should not have agreed to this timeline. I have tested the limits of both Claude as well as my own sanity trying to get this thing done. I completely scrapped the old frontend, as everything was seriously outdated and I wanted to play with the latest and greatest. I'm talkin' React 16 JS → React 19 TypeScript, React Query v2 → TanStack Query v5, React Router v4 w/ hashrouter → TanStack Router w/ file-based routing, Material UI v4 → MUI v7, all with strict adherence to best practices. The project is now at ~300-400k LOC and my life expectancy ~5 years shorter. It's finally ready to put up for testing, and I am incredibly happy with how things have turned out.

This used to be a project with insurmountable tech debt, ZERO test coverage, HORRIBLE developer experience (testing things was an absolute nightmare), and all sorts of jank going on. I addressed all of those issues with decent test coverage, manageable tech debt, and implemented a command-line tool for generating test data as well as a dev mode to test different features on the frontend. During this time, I have gotten to know CC's abilities and what to expect out of it.

A Note on Quality and Consistency

I've noticed a recurring theme in forums and discussions - people experiencing frustration with usage limits and concerns about output quality declining over time. I want to be clear up front: I'm not here to dismiss those experiences or claim it's simply a matter of "doing it wrong." Everyone's use cases and contexts are different, and valid concerns deserve to be heard.

That said, I want to share what's been working for me. In my experience, CC's output has actually improved significantly over the last couple of months, and I believe that's largely due to the workflow I've been constantly refining. My hope is that if you take even a small bit of inspiration from my system and integrate it into your CC workflow, you'll give it a better chance at producing quality output that you're happy with.

Now, let's be real - there are absolutely times when Claude completely misses the mark and produces suboptimal code. This can happen for various reasons. First, AI models are stochastic, meaning you can get widely varying outputs from the same input. Sometimes the randomness just doesn't go your way, and you get an output that's legitimately poor quality through no fault of your own. Other times, it's about how the prompt is structured. There can be significant differences in outputs given slightly different wording because the model takes things quite literally. If you misword or phrase something ambiguously, it can lead to vastly inferior results.

Sometimes You Just Need to Step In

Look, AI is incredible, but it's not magic. There are certain problems where pattern recognition and human intuition just win. If you've spent 30 minutes watching Claude struggle with something that you could fix in 2 minutes, just fix it yourself. No shame in that. Think of it like teaching someone to ride a bike - sometimes you just need to steady the handlebars for a second before letting go again.

I've seen this especially with logic puzzles or problems that require real-world common sense. AI can brute-force a lot of things, but sometimes a human just "gets it" faster. Don't let stubbornness or some misguided sense of "but the AI should do everything" waste your time. Step in, fix the issue, and keep moving.

I've had my fair share of terrible prompting, which usually happens towards the end of the day where I'm getting lazy and I'm not putting that much effort into my prompts. And the results really show. So next time you are having these kinds of issues where you think the output is way worse these days because you think Anthropic shadow-nerfed Claude, I encourage you to take a step back and reflect on how you are prompting.

Re-prompt often. You can hit double-esc to bring up your previous prompts and select one to branch from. You'd be amazed how often you can get way better results armed with the knowledge of what you don't want when giving the same prompt. All that to say, there can be many reasons why the output quality seems to be worse, and it's good to self-reflect and consider what you can do to give it the best possible chance to get the output you want.

As some wise dude somewhere probably said, "Ask not what Claude can do for you, ask what context you can give to Claude" ~ Wise Dude

Alright, I'm going to step down from my soapbox now and get on to the good stuff.

My System

I've implemented a lot changes to my workflow as it relates to CC over the last 6 months, and the results have been pretty great, IMO.

Skills Auto-Activation System (Game Changer!)

This one deserves its own section because it completely transformed how I work with Claude Code.

The Problem

So Anthropic releases this Skills feature, and I'm thinking "this looks awesome!" The idea of having these portable, reusable guidelines that Claude can reference sounded perfect for maintaining consistency across my massive codebase. I spent a good chunk of time with Claude writing up comprehensive skills for frontend development, backend development, database operations, workflow management, etc. We're talking thousands of lines of best practices, patterns, and examples.

And then... nothing. Claude just wouldn't use them. I'd literally use the exact keywords from the skill descriptions. Nothing. I'd work on files that should trigger the skills. Nothing. It was incredibly frustrating because I could see the potential, but the skills just sat there like expensive decorations.

The "Aha!" Moment

That's when I had the idea of using hooks. If Claude won't automatically use skills, what if I built a system that MAKES it check for relevant skills before doing anything?

So I dove into Claude Code's hook system and built a multi-layered auto-activation architecture with TypeScript hooks. And it actually works!

How It Works

I created two main hooks:

1. UserPromptSubmit Hook (runs BEFORE Claude sees your message):

Analyzes your prompt for keywords and intent patterns
Checks which skills might be relevant
Injects a formatted reminder into Claude's context
Now when I ask "how does the layout system work?" Claude sees a big "🎯 SKILL ACTIVATION CHECK - Use project-catalog-developer skill" (project catalog is a large complex data grid based feature on my front end) before even reading my question

2. Stop Event Hook (runs AFTER Claude finishes responding):

Analyzes which files were edited
Checks for risky patterns (try-catch blocks, database operations, async functions)
Displays a gentle self-check reminder
"Did you add error handling? Are Prisma operations using the repository pattern?"
Non-blocking, just keeps Claude aware without being annoying

skill-rules.json Configuration

I created a central configuration file that defines every skill with:

Keywords: Explicit topic matches ("layout", "workflow", "database")
Intent patterns: Regex to catch actions ("(create|add).*?(feature|route)")
File path triggers: Activates based on what file you're editing
Content triggers: Activates if file contains specific patterns (Prisma imports, controllers, etc.)

Example snippet:

{
  "backend-dev-guidelines": {
    "type": "domain",
    "enforcement": "suggest",
    "priority": "high",
    "promptTriggers": {
      "keywords": ["backend", "controller", "service", "API", "endpoint"],
      "intentPatterns": [
        "(create|add).*?(route|endpoint|controller)",
        "(how to|best practice).*?(backend|API)"
      ]
    },
    "fileTriggers": {
      "pathPatterns": ["backend/src/**/*.ts"],
      "contentPatterns": ["router\\.", "export.*Controller"]
    }
  }
}

The Results

Now when I work on backend code, Claude automatically:

Sees the skill suggestion before reading my prompt
Loads the relevant guidelines
Actually follows the patterns consistently
Self-checks at the end via gentle reminders

The difference is night and day. No more inconsistent code. No more "wait, Claude used the old pattern again." No more manually telling it to check the guidelines every single time.

Following Anthropic's Best Practices (The Hard Way)

After getting the auto-activation working, I dove deeper and found Anthropic's official best practices docs. Turns out I was doing it wrong because they recommend keeping the main SKILL.md file under 500 lines and using progressive disclosure with resource files.

Whoops. My frontend-dev-guidelines skill was 1,500+ lines. And I had a couple other skills over 1,000 lines. These monolithic files were defeating the whole purpose of skills (loading only what you need).

So I restructured everything:

frontend-dev-guidelines: 398-line main file + 10 resource files
backend-dev-guidelines: 304-line main file + 11 resource files

Now Claude loads the lightweight main file initially, and only pulls in detailed resource files when actually needed. Token efficiency improved 40-60% for most queries.

Skills I've Created

Here's my current skill lineup:

Guidelines & Best Practices:

backend-dev-guidelines - Routes → Controllers → Services → Repositories
frontend-dev-guidelines - React 19, MUI v7, TanStack Query/Router patterns
skill-developer - Meta-skill for creating more skills

Domain-Specific:

workflow-developer - Complex workflow engine patterns
notification-developer - Email/notification system
database-verification - Prevent column name errors (this one is a guardrail that actually blocks edits!)
project-catalog-developer - DataGrid layout system

All of these automatically activate based on what I'm working on. It's like having a senior dev who actually remembers all the patterns looking over Claude's shoulder.

Why This Matters

Before skills + hooks:

Claude would use old patterns even though I documented new ones
Had to manually tell Claude to check BEST_PRACTICES.md every time
Inconsistent code across the 300k+ LOC codebase
Spent too much time fixing Claude's "creative interpretations"

After skills + hooks:

Consistent patterns automatically enforced
Claude self-corrects before I even see the code
Can trust that guidelines are being followed
Way less time spent on reviews and fixes

If you're working on a large codebase with established patterns, I cannot recommend this system enough. The initial setup took a couple of days to get right, but it's paid for itself ten times over.

CLAUDE.md and Documentation Evolution

In a post I wrote 6 months ago, I had a section about rules being your best friend, which I still stand by. But my CLAUDE.md file was quickly getting out of hand and was trying to do too much. I also had this massive BEST_PRACTICES.md file (1,400+ lines) that Claude would sometimes read and sometimes completely ignore.

So I took an afternoon with Claude to consolidate and reorganize everything into a new system. Here's what changed:

What Moved to Skills

Previously, BEST_PRACTICES.md contained:

TypeScript standards
React patterns (hooks, components, suspense)
Backend API patterns (routes, controllers, services)
Error handling (Sentry integration)
Database patterns (Prisma usage)
Testing guidelines
Performance optimization

All of that is now in skills with the auto-activation hook ensuring Claude actually uses them. No more hoping Claude remembers to check BEST_PRACTICES.md.

What Stayed in CLAUDE.md

Now CLAUDE.md is laser-focused on project-specific info (only ~200 lines):

Quick commands (pnpm pm2:start, pnpm build, etc.)
Service-specific configuration
Task management workflow (dev docs system)
Testing authenticated routes
Workflow dry-run mode
Browser tools configuration

The New Structure

Root CLAUDE.md (100 lines)
├── Critical universal rules
├── Points to repo-specific claude.md files
└── References skills for detailed guidelines

Each Repo's claude.md (50-100 lines)
├── Quick Start section pointing to:
│   ├── PROJECT_KNOWLEDGE.md - Architecture & integration
│   ├── TROUBLESHOOTING.md - Common issues
│   └── Auto-generated API docs
└── Repo-specific quirks and commands

The magic: Skills handle all the "how to write code" guidelines, and CLAUDE.md handles "how this specific project works." Separation of concerns for the win.

Dev Docs System

This system, out of everything (besides skills), I think has made the most impact on the results I'm getting out of CC. Claude is like an extremely confident junior dev with extreme amnesia, losing track of what they're doing easily. This system is aimed at solving those shortcomings.

The dev docs section from my CLAUDE.md:

### Starting Large Tasks

When exiting plan mode with an accepted plan: 1.**Create Task Directory**:
mkdir -p ~/git/project/dev/active/[task-name]/

2.**Create Documents**:

- `[task-name]-plan.md` - The accepted plan
- `[task-name]-context.md` - Key files, decisions
- `[task-name]-tasks.md` - Checklist of work

3.**Update Regularly**: Mark tasks complete immediately

### Continuing Tasks

- Check `/dev/active/` for existing tasks
- Read all three files before proceeding
- Update "Last Updated" timestamps

These are documents that always get created for every feature or large task. Before using this system, I had many times when I all of a sudden realized that Claude had lost the plot and we were no longer implementing what we had planned out 30 minutes earlier because we went off on some tangent for whatever reason.

My Planning Process

My process starts with planning. Planning is king. If you aren't at a minimum using planning mode before asking Claude to implement something, you're gonna have a bad time, mmm'kay. You wouldn't have a builder come to your house and start slapping on an addition without having him draw things up first.

When I start planning a feature, I put it into planning mode, even though I will eventually have Claude write the plan down in a markdown file. I'm not sure putting it into planning mode necessary, but to me, it feels like planning mode gets better results doing the research on your codebase and getting all the correct context to be able to put together a plan.

I created a strategic-plan-architect subagent that's basically a planning beast. It:

Gathers context efficiently
Analyzes project structure
Creates comprehensive structured plans with executive summary, phases, tasks, risks, success metrics, timelines
Generates three files automatically: plan, context, and tasks checklist

But I find it really annoying that you can't see the agent's output, and even more annoying is if you say no to the plan, it just kills the agent instead of continuing to plan. So I also created a custom slash command (/dev-docs) with the same prompt to use on the main CC instance.

Once Claude spits out that beautiful plan, I take time to review it thoroughly. This step is really important. Take time to understand it, and you'd be surprised at how often you catch silly mistakes or Claude misunderstanding a very vital part of the request or task.

More often than not, I'll be at 15% context left or less after exiting plan mode. But that's okay because we're going to put everything we need to start fresh into our dev docs. Claude usually likes to just jump in guns blazing, so I immediately slap the ESC key to interrupt and run my /dev-docs slash command. The command takes the approved plan and creates all three files, sometimes doing a bit more research to fill in gaps if there's enough context left.

And once I'm done with that, I'm pretty much set to have Claude fully implement the feature without getting lost or losing track of what it was doing, even through an auto-compaction. I just make sure to remind Claude every once in a while to update the tasks as well as the context file with any relevant context. And once I'm running low on context in the current session, I just run my slash command /update-dev-docs. Claude will note any relevant context (with next steps) as well as mark any completed tasks or add new tasks before I compact the conversation. And all I need to say is "continue" in the new session.

During implementation, depending on the size of the feature or task, I will specifically tell Claude to only implement one or two sections at a time. That way, I'm getting the chance to go in and review the code in between each set of tasks. And periodically, I have a subagent also reviewing the changes so I can catch big mistakes early on. If you aren't having Claude review its own code, then I highly recommend it because it saved me a lot of headaches catching critical errors, missing implementations, inconsistent code, and security flaws.

PM2 Process Management (Backend Debugging Game Changer)

This one's a relatively recent addition, but it's made debugging backend issues so much easier.

The Problem

My project has seven backend microservices running simultaneously. The issue was that Claude didn't have access to view the logs while services were running. I couldn't just ask "what's going wrong with the email service?" - Claude couldn't see the logs without me manually copying and pasting them into chat.

The Intermediate Solution

For a while, I had each service write its output to a timestamped log file using a devLog script. This worked... okay. Claude could read the log files, but it was clunky. Logs weren't real-time, services wouldn't auto-restart on crashes, and managing everything was a pain.

The Real Solution: PM2

Then I discovered PM2, and it was a game changer. I configured all my backend services to run via PM2 with a single command: pnpm pm2:start

What this gives me:

Each service runs as a managed process with its own log file
Claude can easily read individual service logs in real-time
Automatic restarts on crashes
Real-time monitoring with pm2 logs
Memory/CPU monitoring with pm2 monit
Easy service management (pm2 restart email, pm2 stop all, etc.)

PM2 Configuration:

// ecosystem.config.jsmodule.exports = {
  apps: [
    {
      name: 'form-service',
      script: 'npm',
      args: 'start',
      cwd: './form',
      error_file: './form/logs/error.log',
      out_file: './form/logs/out.log',
    },
// ... 6 more services
  ]
};

Before PM2:

Me: "The email service is throwing errors"
Me: [Manually finds and copies logs]
Me: [Pastes into chat]
Claude: "Let me analyze this..."

The debugging workflow now:

Me: "The email service is throwing errors"
Claude: [Runs] pm2 logs email --lines 200
Claude: [Reads the logs] "I see the issue - database connection timeout..."
Claude: [Runs] pm2 restart email
Claude: "Restarted the service, monitoring for errors..."

Night and day difference. Claude can autonomously debug issues now without me being a human log-fetching service.

One caveat: Hot reload doesn't work with PM2, so I still run the frontend separately with pnpm dev. But for backend services that don't need hot reload as often, PM2 is incredible.

Hooks System (#NoMessLeftBehind)

The project I'm working on is multi-root and has about eight different repos in the root project directory. One for the frontend and seven microservices and utilities for the backend. I'm constantly bouncing around making changes in a couple of repos at a time depending on the feature.

And one thing that would annoy me to no end is when Claude forgets to run the build command in whatever repo it's editing to catch errors. And it will just leave a dozen or so TypeScript errors without me catching it. Then a couple of hours later I see Claude running a build script like a good boy and I see the output: "There are several TypeScript errors, but they are unrelated, so we're all good here!"

No, we are not good, Claude.

Hook #1: File Edit Tracker

First, I created a post-tool-use hook that runs after every Edit/Write/MultiEdit operation. It logs:

Which files were edited
What repo they belong to
Timestamps

Initially, I made it run builds immediately after each edit, but that was stupidly inefficient. Claude makes edits that break things all the time before quickly fixing them.

Hook #2: Build Checker

Then I added a Stop hook that runs when Claude finishes responding. It:

Reads the edit logs to find which repos were modified
Runs build scripts on each affected repo
Checks for TypeScript errors
If < 5 errors: Shows them to Claude
If ≥ 5 errors: Recommends launching auto-error-resolver agent
Logs everything for debugging

Since implementing this system, I've not had a single instance where Claude has left errors in the code for me to find later. The hook catches them immediately, and Claude fixes them before moving on.

Hook #3: Prettier Formatter

This one's simple but effective. After Claude finishes responding, automatically format all edited files with Prettier using the appropriate .prettierrc config for that repo.

No more going into to manually edit a file just to have prettier run and produce 20 changes because Claude decided to leave off trailing commas last week when we created that file.

⚠️ Update: I No Longer Recommend This Hook

After publishing, a reader shared detailed data showing that file modifications trigger <system-reminder> notifications that can consume significant context tokens. In their case, Prettier formatting led to 160k tokens consumed in just 3 rounds due to system-reminders showing file diffs.

While the impact varies by project (large files and strict formatting rules are worst-case scenarios), I'm removing this hook from my setup. It's not a big deal to let formatting happen when you manually edit files anyway, and the potential token cost isn't worth the convenience.

If you want automatic formatting, consider running Prettier manually between sessions instead of during Claude conversations.

Hook #4: Error Handling Reminder

This is the gentle philosophy hook I mentioned earlier:

Analyzes edited files after Claude finishes
Detects risky patterns (try-catch, async operations, database calls, controllers)
Shows a gentle reminder if risky code was written
Claude self-assesses whether error handling is needed
No blocking, no friction, just awareness

Example output:

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
📋 ERROR HANDLING SELF-CHECK
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

⚠️  Backend Changes Detected
   2 file(s) edited

   ❓ Did you add Sentry.captureException() in catch blocks?
   ❓ Are Prisma operations wrapped in error handling?

   💡 Backend Best Practice:
      - All errors should be captured to Sentry
      - Controllers should extend BaseController
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

The Complete Hook Pipeline

Here's what happens on every Claude response now:

Claude finishes responding
  ↓
Hook 1: Prettier formatter runs → All edited files auto-formatted
  ↓
Hook 2: Build checker runs → TypeScript errors caught immediately
  ↓
Hook 3: Error reminder runs → Gentle self-check for error handling
  ↓
If errors found → Claude sees them and fixes
  ↓
If too many errors → Auto-error-resolver agent recommended
  ↓
Result: Clean, formatted, error-free code

And the UserPromptSubmit hook ensures Claude loads relevant skills BEFORE even starting work.

No mess left behind. It's beautiful.

Scripts Attached to Skills

One really cool pattern I picked up from Anthropic's official skill examples on GitHub: attach utility scripts to skills.

For example, my backend-dev-guidelines skill has a section about testing authenticated routes. Instead of just explaining how authentication works, the skill references an actual script:

### Testing Authenticated Routes

Use the provided test-auth-route.js script:


`node scripts/test-auth-route.js http://localhost:3002/api/endpoint`

The script handles all the complex authentication steps for you:

Gets a refresh token from Keycloak
Signs the token with JWT secret
Creates cookie header
Makes authenticated request

When Claude needs to test a route, it knows exactly what script to use and how to use it. No more "let me create a test script" and reinventing the wheel every time.

I'm planning to expand this pattern - attach more utility scripts to relevant skills so Claude has ready-to-use tools instead of generating them from scratch.

Tools and Other Things

SuperWhisper on Mac

Voice-to-text for prompting when my hands are tired from typing. Works surprisingly well, and Claude understands my rambling voice-to-text surprisingly well.

Memory MCP

I use this less over time now that skills handle most of the "remembering patterns" work. But it's still useful for tracking project-specific decisions and architectural choices that don't belong in skills.

BetterTouchTool

Relative URL copy from Cursor (for sharing code references)
- I have VSCode open to more easily find the files I’m looking for and I can double tap CAPS-LOCK, then BTT inputs the shortcut to copy relative URL, transforms the clipboard contents by prepending an ‘@’ symbol, focuses the terminal, and then pastes the file path. All in one.
Double-tap hotkeys to quickly focus apps (CMD+CMD = Claude Code, OPT+OPT = Browser)
Custom gestures for common actions

Honestly, the time savings on just not fumbling between apps is worth the BTT purchase alone.

Scripts for Everything

If there's any annoying tedious task, chances are there's a script for that:

Command-line tool to generate mock test data. Before using Claude code, it was extremely annoying to generate mock data because I would have to make a submission to a form that had about a 120 questions Just to generate one single test submission.
Authentication testing scripts (get tokens, test routes)
Database resetting and seeding
Schema diff checker before migrations
Automated backup and restore for dev database

Pro tip: When Claude helps you write a useful script, immediately document it in CLAUDE.md or attach it to a relevant skill. Future you will thank past you.

Documentation (Still Important, But Evolved)

I think next to planning, documentation is almost just as important. I document everything as I go in addition to the dev docs that are created for each task or feature. From system architecture to data flow diagrams to actual developer docs and APIs, just to name a few.

But here's what changed: Documentation now works WITH skills, not instead of them.

Skills contain: Reusable patterns, best practices, how-to guides Documentation contains: System architecture, data flows, API references, integration points

For example:

"How to create a controller" → backend-dev-guidelines skill
"How our workflow engine works" → Architecture documentation
"How to write React components" → frontend-dev-guidelines skill
"How notifications flow through the system" → Data flow diagram + notification skill

I still have a LOT of docs (850+ markdown files), but now they're laser-focused on project-specific architecture rather than repeating general best practices that are better served by skills.

You don't necessarily have to go that crazy, but I highly recommend setting up multiple levels of documentation. Ones for broad architectural overview of specific services, wherein you'll include paths to other documentation that goes into more specifics of different parts of the architecture. It will make a major difference on Claude's ability to easily navigate your codebase.

Prompt Tips

When you're writing out your prompt, you should try to be as specific as possible about what you are wanting as a result. Once again, you wouldn't ask a builder to come out and build you a new bathroom without at least discussing plans, right?

"You're absolutely right! Shag carpet probably is not the best idea to have in a bathroom."

Sometimes you might not know the specifics, and that's okay. If you don't ask questions, tell Claude to research and come back with several potential solutions. You could even use a specialized subagent or use any other AI chat interface to do your research. The world is your oyster. I promise you this will pay dividends because you will be able to look at the plan that Claude has produced and have a better idea if it's good, bad, or needs adjustments. Otherwise, you're just flying blind, pure vibe-coding. Then you're gonna end up in a situation where you don't even know what context to include because you don't know what files are related to the thing you're trying to fix.

Try not to lead in your prompts if you want honest, unbiased feedback. If you're unsure about something Claude did, ask about it in a neutral way instead of saying, "Is this good or bad?" Claude tends to tell you what it thinks you want to hear, so leading questions can skew the response. It's better to just describe the situation and ask for thoughts or alternatives. That way, you'll get a more balanced answer.

Agents, Hooks, and Slash Commands (The Holy Trinity)

Agents

I've built a small army of specialized agents:

Quality Control:

code-architecture-reviewer - Reviews code for best practices adherence
build-error-resolver - Systematically fixes TypeScript errors
refactor-planner - Creates comprehensive refactoring plans

Testing & Debugging:

auth-route-tester - Tests backend routes with authentication
auth-route-debugger - Debugs 401/403 errors and route issues
frontend-error-fixer - Diagnoses and fixes frontend errors

Planning & Strategy:

strategic-plan-architect - Creates detailed implementation plans
plan-reviewer - Reviews plans before implementation
documentation-architect - Creates/updates documentation

Specialized:

frontend-ux-designer - Fixes styling and UX issues
web-research-specialist - Researches issues along with many other things on the web
reactour-walkthrough-designer - Creates UI tours

The key with agents is to give them very specific roles and clear instructions on what to return. I learned this the hard way after creating agents that would go off and do who-knows-what and come back with "I fixed it!" without telling me what they fixed.

Hooks (Covered Above)

The hook system is honestly what ties everything together. Without hooks:

Skills sit unused
Errors slip through
Code is inconsistently formatted
No automatic quality checks

With hooks:

Skills auto-activate
Zero errors left behind
Automatic formatting
Quality awareness built-in

Slash Commands

I have quite a few custom slash commands, but these are the ones I use most:

Planning & Docs:

/dev-docs - Create comprehensive strategic plan
/dev-docs-update - Update dev docs before compaction
/create-dev-docs - Convert approved plan to dev doc files

Quality & Review:

/code-review - Architectural code review
/build-and-fix - Run builds and fix all errors

Testing:

/route-research-for-testing - Find affected routes and launch tests
/test-route - Test specific authenticated routes

The beauty of slash commands is they expand into full prompts, so you can pack a ton of context and instructions into a simple command. Way better than typing out the same instructions every time.

Conclusion

After six months of hardcore use, here's what I've learned:

The Essentials:

Plan everything - Use planning mode or strategic-plan-architect
Skills + Hooks - Auto-activation is the only way skills actually work reliably
Dev docs system - Prevents Claude from losing the plot
Code reviews - Have Claude review its own work
PM2 for backend - Makes debugging actually bearable

The Nice-to-Haves:

Specialized agents for common tasks
Slash commands for repeated workflows
Comprehensive documentation
Utility scripts attached to skills
Memory MCP for decisions

And that's about all I can think of for now. Like I said, I'm just some guy, and I would love to hear tips and tricks from everybody else, as well as any criticisms. Because I'm always up for improving upon my workflow. I honestly just wanted to share what's working for me with other people since I don't really have anybody else to share this with IRL (my team is very small, and they are all very slow getting on the AI train).

If you made it this far, thanks for taking the time to read. If you have questions about any of this stuff or want more details on implementation, happy to share. The hooks and skills system especially took some trial and error to get right, but now that it's working, I can't imagine going back.

TL;DR: Built an auto-activation system for Claude Code skills using TypeScript hooks, created a dev docs workflow to prevent context loss, and implemented PM2 + automated error checking. Result: Solo rewrote 300k LOC in 6 months with consistent quality.

Top answer

1 of 5

The beauty of Reddit: You click on a post thinking “another clickbait, terrible post” and end up finding a masterpiece of a post.

2 of 5

This - this is a fucking MASTERPIECE of "CLAUDE CODE 101". Thank you for sharing this - this is exactly the curve I followed, with incredible success. You're an incredibly unique case for Claude Code - a seasoned developer *embracing* AI tools, not shrugging them off as "stupid" or "a threat". This is exactly the way. 99% of gripes, questions, and issues faced in this subreddit can be answered with this post. This is not the information people want to hear, but it's the information they **need** to hear to achieve technical excellence.

Find elsewhere

Google Bing Mojeek

reddit.com › r/claudecode › claude code isn't getting worse. your codebase is just getting bigger

r/ClaudeCode on Reddit: Claude Code isn't getting worse. Your codebase is just getting bigger

September 26, 2025 -

Many people have noticed quality declining. Here's what I think is actually happening:

Most of us have been building the same project for weeks if not months now. Our codebases grew from a few thousand LOC to over 10k. CC doesn't have 1M token context and won't read all your files (trust me, I've tried).

It requires a different approach at scale.

Here's what stopped working for me:

Vague prompts without context
Assuming it knows your file structure
Quick instructions that worked with less than 20 files

What works for me now:

Start every prompt with: "Read these files first: "
Give surgical instructions: "In /api/chat.js line 45, modify the function to..."
Follow up with "Review your edit and it's integration into my app"

I used to spend 1 minute prompting and 30 minutes debugging. Now I spend 10 minutes writing detailed prompts and get working code immediately.

This is what shifted for me. Your codebase got complex. Claude Code needs onboarding like a new developer would. Give it context, be specific, verify outputs.

My success rate with this approach is now over 90% first try. For the ones that don't make it, it's just a few tweaks away.

Been using CC since launch, tried Cursor, Codex, Replit, everything else. For me Opus in CC is hands down the best, but codex is not far behind. Sometimes I will have codex be the reviewer, and CC the dev.

Anyone else find any other techniques that work for larger codebases?

Top answer

1 of 5

No. That’s not the problem that I had. I was working on project that was quite big and cc was working fine. Suddenly it stopped working fine and started working terrible. On the same codebase. It wasn’t a vibe code project. It was the same big project that it was to handle before August 14. Now a month later seems to be able to work again on the project. So, the problem is not a “a quite large project because a few months of vibe coding”. Cc has degraded their performance and now it’s coming back to what it was before.

2 of 5

Sorry, but I disagree. Literally having a way better experience with gpt 5 than with opus every single day.

reddit.com › r/anthropic › its crazy how bad claude has gotten over the past couple weeks

r/Anthropic on Reddit: Its crazy how bad claude has gotten over the past couple weeks

August 3, 2025 -

I don't think I've ever seen a product degrade so fast on its customers as I'm seeing with Claude Code.

Two weeks ago this could solve very complex problems and now today it can't even correctly change colors of an app.

Very disappointing to see what anthropic is doing to their customers

Top answer

1 of 5

posts like this, seeming to imply a declaration of something as fact, based on sample size of 1 anecdotal case from 1 person as evidence, just seems wild to me I have not seen any change in the model personally, every now and again it will suck at something, and it is often me not giving it enough context or just something it was not trained that well on. but this has always been the case. not a new thing. I agree with some of the other commenters tbh, skill issues

2 of 5

I have no issues while using the api pricing.

reddit.com › r/experienceddevs › how many people here use claude code?

r/ExperiencedDevs on Reddit: How many people here use Claude code?

July 18, 2025 -

I used to think cursor was pretty average and not super helpful, but Claude code with opus 4 takes longer and seems to be a lot better at generating quality code without needing to spec every single requirement.

I still do review the code but I feel like I’m trusting it more because the quality is better.

Interested to hear your thoughts

Top answer

1 of 5

I kept trying it and it's garbage . Unless I spend the same amount of time I would developing the damn things myself on designing dumbass .md AI prompt files. Github Copilot Edit mode and ChatGPT are more useful for my specific usecase. Claude code did write up an impressive raytracer and some nice looking web pages but.. Real life needs extremely specific things. Even then, my AI usage has been really tapered recently. And I'm one of the early adopters and I always kept switching models, even hosted opensource ones on my machine. All these stupid LLM fanboys on this sub will have the same réalisation I had: Shits not THAT effective irl. But it is good for learning and explaining stuff back to you.

2 of 5

How many people here would like to see an end to endless AI questions...?

reddit.com › r/anthropic › claude has been objectively dumbified

r/Anthropic on Reddit: Claude has been objectively dumbified

July 17, 2025 -

Claude has been objectively dumbified. In order to prove it, you only need to checkout previous branches of features you coded with CC; reset them; use the same prompts, and try to code them again with Claude. It will produce a lot more bugs than before. (and, in my case, it completely failed to build the feature it had built before, even after many iterations).

TL;DR: if you're considering buying the max subscription, I do not recommend buying it just yet. Wait until Anthropic can properly handle the increased traffic, and restore Claude back to its previous performance levels.

>> Come at me, Claude glazers, bots and virtual d riders.

I've been using the pro max 20x subscription for the last three months. I'm using both Claude Code and the regular UI interface to code, produce documentation, and debate coding solutions.

About a week ago, I started running into more and more bugs, and less and less quality responses from Claude, to the point that it started to look stupid and frustrating, and I found myself investing more time correcting it and debugging stupid bugs, than actually shipping features - the classical case of decreasing productivity for actual software engineers, where the only usefulness of an llm is to be used as snippet code producer and syntax Q&A.

I decided to objectively test Claude out, and went back to previous implemented feature branch, reset it to its starting point, got the prompts back (i have a gemini gem that i use to re-prompt engineer all my prompts before submitting them to claude, so getting all the prompts that i used to develop that feature was not hard at all: i just had to visit my gemini's history), and tried to fed the prompts, by the same order as before, to CC. Result: couldn't even get through the third prompt (~20% of the feature), since it produced much buggier code than before; i reset the branch, opened a new instance of CC, and tried again - only for it to produce bugs it hadn't produced before; i tried a thrid time, and the same thing happened.

It's objectively dumbified. the upvotes and downvotes in this subreddit are hilariously skewed by anthropic bots, and claude glazer that behave like religious fanatics.

Btw, I'm a claude fan; just not a fan of this version of claude, and the gaslighting, bot-based tactics employed by Anthropic. They should be transparent and honest about it.

EDIT: Anthropic finally admitted to having servers under strain, and actively trying to improve their infrastructure. In this post they addressed the limit rates that people also have been complaining about for the last week: https://www.reddit.com/r/ClaudeAI/comments/1m2gv92/weve_increased_api_rate_limits_for_claude_sonnet/

Hopefully, they'll also fix the reasoning skills soon. Once again: GASLIGHTING your own paying audience IS BAD! BE OPEN AND HONEST ABOUT IT, SO THAT PEOPLE WHO RELY ON YOUR PRODUCT ADJUST ACCORDINGLY, you greedy bastards !!!

Top answer

1 of 5

Yeah, I feel the same way. It seems like they deliberately reduce the quality of the current model to save compute, likely redirecting those resources toward training the next one. It’s a cycle. I think this might be a sign they’re getting ready to release a new model in the next couple of months.

2 of 5

For the last few years I have always sceptical of the 'my model got dumber posts'. However this last week I have noticed massive swings in capability with Opus. For a single day Opus seemed to be operating back at a Chatgpt 3 level. It couldn't even do simple CSS changes, where as the previous day it was building me beautiful complex pages. There was also a visceral 'understanding what I meant' difference where Claude regressed massively in understanding my intentions. This was on stand alone html pages so it wasn't a 'getting too complex' issue. However after that full day of frustration the next day it was completely back to it's old self and building things that stunned me again. There definitely WAS an issue, and people just writing these concerns off wholesale aren't being fair. For those of us using Claude Code all day every day you do become quite attuned to it's current capability level and when there is variable performance it is quite obvious.

reddit.com › r/anthropic › claude is back

r/Anthropic on Reddit: Claude is back

September 20, 2025 -

I complained here in the last few days, Claude was producing objectively poor or very poor code at times in the last few weeks. Producing bad code and not following instructions.

The last two days were great. One-shotted everything.

Artefact issues were also less than usual it seems (artefact not updating or showing the previous version). I still believe this part is shaky and could be improved.

Very happy about this, thanks for fixing the model. I am using Sonnet via the claude.ai UI, pro plan.

Top answer

1 of 5

Performance is erratic. It couldn’t do basic UI updates like spacing for me. I switched to gpt5 and 1 hour i had 6 updates finished.

2 of 5

You're asking to be in an abusive relationship.

reddit.com › r/chatgptcoding › [deleted by user]

Why is Claude 3.7 so good? : r/ChatGPTCoding

February 27, 2025 - I never knew cursor gave away a free premium for claude 3.7 thinking so when I used that instead of gemini 2.5 pro, I came to a whole new high. Like claude I'm not a fanboy but it's almost as if a scientist is sitting right on the other side. Like I was working on adopting ORBSLAM in to python.

reddit.com › r › Anthropic

Anthropic

February 4, 2023 - I tried Claude for the first time today and I’m shocked at how natural it feels to converse with this LLM. I don’t feel patronized or shamed in what I can or can’t say. I’ve discussed the same subjects and haven’t gotten a policy warning/disclaimer.

CBS News

cbsnews.com › moneywatch › reddit sues anthropic over alleged "scraping" of content to train claude

Reddit sues Anthropic over alleged "scraping" of user comments to train AI chatbot Claude - CBS News

June 4, 2025 - Social media platform Reddit sued the artificial intelligence company Anthropic on Wednesday, alleging that it is illegally "scraping" the comments of millions of Reddit users to train its chatbot Claude.

reddit.com › r/singularity › "claude code wrote 80% of its own code" - anthropic dev

r/singularity on Reddit: "Claude Code wrote 80% of its own code" - anthropic dev

May 8, 2025 -

I am listening to an interview at the moment with the developer who kicked off the claude code project internally (agentic SWE tool). He was asked how much of the code was actually generated by claude code itself and provided a pretty surprising number. Granted, humans still did the directing and definitely reviewed the code, but that is pretty wild.

If we look ahead a couple of years, it seems very plausible that these agents will be writing close to 99% of their own code, with humans providing the direction rather than jumping in - doing line-by-line work. Autonomous ML research agents are definitely fascinating and will be great, but these types of SWE agents (cline/CC/windsurf/etc), that are able to indefinitely build and improve themselves should lead to great gains for us as well.

Top answer

1 of 5

404

They'll eventually be doing 100% of their own coding, and after that they'll be doing so in ways that are not understandable by humans.

2 of 5

138

I just don't believe that. It's either a lie, or misrepresentation, or misinterpretation of what was going on. I'd you've ever coded with AI, after like 2000 lines of code, it can't keep track of everything. AI simply cannot maintain projects that complex/lengthy. At that point, the human is doing more work than the AI.

reddit.com › r/openai › okay yes, claude is better than chatgpt for now

r/OpenAI on Reddit: Okay yes, Claude is better than ChatGPT for now

April 11, 2024 -

Been a ChatGPT pro user since atleast 7 months. Been using it every single day for coding and other business tasks. I feel a bit sad to say that it has lost is charm to a certain extent. It's not as powerful as I feel Claude is right now. I was not quickly impressed by the claims people were making about Claude but then I went ahead created an account and gave it a couple of problems ChatGPT was struggling with and it handled it with expertise which I instantly felt. Kept using it for a while and for the problems ChatGPT 4o was behaving like 3.5, it gave me solutions which were grounded and clear. Debugging is much more robust with Sonnet.

I hope ChatGPT gets its grip back as it has got more incentives for pro users but since last two days Claude helped me save a couple of hours. I have begun thinking about migration, atleast for a time being. Or keep pro for both tools.

Wanted to put it out there.

Edit: I just subscribed to Claude Pro. Keeping both subscriptions for now. I have a couple of ongoing projects and I believe I have a use case for both. With the limits removed, I have worked on Claude more than ChatGPT, it's not been too long though, around an hour.

I may edit this post again in near future with my findings and for others to decide.

_________________________________________________________________

Edit: January 23rd, 2025.

It's been seven months since I first posted, which seems to rank high for Claude vs ChatGPT searches. I wanted to update on my journey as promised.

After switching from ChatGPT to Claude, I never looked back. My entire coding workflow shifted to Claude, specifically Claude 3.5 Sonnet. I started with Claude Chat directly, but when Cursor emerged, I tried it and found it to be the most efficient way to code using Sonnet. These days, I no longer maintain a Claude subscription and exclusively use Cursor.

I only resubscribed to ChatGPT last month (just one month and no more) for real-time voice chat (language learning). I still use it for basic tasks like grammar checks and searches - essentially as a replacement for Google and as a general AI assistant - but never for coding anymore.

For those finding this through Google: it's now well-established in the dev community that Claude 3.5 Sonnet is the most capable and intelligent coding LLM. Cursor's initial popularity was tied to Claude, but it has evolved into a powerful IDE with features like agent composer and much more.

For non-coders: Claude 3.5 Sonnet is, in my opinion, a far more intelligent and precise tool than GPT-4o and even o1. While I can't list all examples here, for every single non-coding task I've given it, I've received more refined, crafted, and precise responses.

This shift was a game-changer for my productivity and business gains. To tech founders and small teams building products: unless ChatGPT specifically fits your coding needs, consider switching to Cursor. It has literally transformed my business and boosted profits significantly. Grateful to the Claude team for their work.

____________________________________________________________________

Edit: Aug 20, 2025

Almost 6 months after writing the above, I want to update that I'm no longer using Cursor as my IDE, it's Claude Code now. This shift happened almost suddenly in June. Twitter was abuzz about Claude Code being much more powerful than Cursor, but I was reluctant since it's never easy to change your coding environment. But one night I thought, let me ask Claude Code to fix this issue I've been struggling with in Cursor. I loaded up $5 in my Anthropic dashboard, logged into my account with CC, and asked it to figure it out. And man, it was a breath of fresh air. It implemented such an elegant solution that Cursor wouldn't even come close to.

Now here's the kicker, in Cursor I was using the exact same model I used in CC: Sonnet 4. But as most devs know, Cursor uses various approaches to wrap, summarize, RAG and all that complex stuff. This, as many have experienced, ends up dumbing it down considerably. After this incident, I knew there was no going back. Especially when I discovered that you can use the same $20 subscription for both Claude web and Code.

So yeah, Claude Code is my main coding (and general) assistant now. I do use Claude web periodically, but with MCP support and other neat features in CC, it feels like it lives right in your machine rather than having to jump to a browser. And let me emphasize, it is significantly better than Cursor.

About GPT-5? Let me be brutally honest: it's disappointing. Doesn't come close to Claude. It's certainly better than GPT-4, but it feels like it's trying to be clever and intelligent without actually being so. On several occasions it gave me these polished-looking answers that turned out to be completely wrong. It also struggles to follow conversational context. I do use it regularly as a souped-up web search replacement, so it has its place. But the hype? Pure fluff.

Top answer

1 of 5

202

I also just switched to Claude yesterday and it helped me make an entire phone app. Incredibly more powerful and truly feels like it listens to what you say. It produced code of 1000 lines which took 4 continues, and each continue was perfectly where it last left off. It blew me away and I’d 110% recommend.

2 of 5

They're both good, and I use both regularly. Competition in this space is also good.

reddit.com › r/anthropic › why do people like claude better than chatgpt?

r/Anthropic on Reddit: Why do people like Claude better than ChatGPT?

February 25, 2025 -

I’ve heard that pretty consistently amongst colleagues but i don’t find the UX as good, it can’t access internet search and it doesn’t have unlimited data. Thoughts? What’s the upside? Genuinely curious. I’ve been trying to transition over but having a bit of a hard time of it.

Top answer

1 of 48

200k context window. Amazing prompt adherence/instruction following. Amazing at coding. UI is minimalistic, but transparent, you know what is actually doing when you upload a file, it does not truncate it or do RAG in the background like chatGPT.

2 of 48

The prose output is better.

reddit.com › r/chatgptcoding › anyone else feel let down by claude 4.

r/ChatGPTCoding on Reddit: Anyone else feel let down by Claude 4.

April 18, 2025 -

The 200k context window is deflating especially when gpt and gemini are eating them for lunch. Even if they went to 500k would be better.

Benchmarks at this point in the A.I game are negligible at best and you sure don't "Feel" a 1% difference between the 3. It feels like we are getting to the point of diminishing returns.

Us as programmers should be able to see the forest from the trees here. We think differently than the normal person. We think outside of the box. We don't get caught in hype as we exist in the realm of research, facts and practicality.

This Claude release is more hype than practical.

Top answer

1 of 5

Been playing with it a lot. I have the Max plan so using both opus with claude code and sonnet in chat. Been very impressed. Feels like a biggg jump from 3.7....I also sub to GPT and Google. And it feels way better than gemini 2.5 pro rn. And def better than gpt for complex tasks, coding, and writing (although I still use 4o and 4.1 the most for casual interactions, questions, quick brainstorming). Its really really impressing me with claude code. 3.7 was great and this feels like a decent jump. Not getting hung up nearly as much. Just my 2c

2 of 5

No chance I'm paying 5X for 1/5 the context window.

reddit.com › r/webdev › claude sonnet 4.5’s bold claims don’t match what software developers are seeing

r/webdev on Reddit: Claude Sonnet 4.5’s Bold Claims Don’t Match What Software Developers Are Seeing

August 29, 2025 - And in Claude Code having been previously spoiled by plan mode I noticed myself trusting 4.5 to plan and execute a lot more (sometimes out of shear laziness). And it’s been successful more frequently than I experienced with 4 alone.