claude 3 opus context window reddit - Brave Search

Is anyone else getting absolutely wrecked by Claude’s context limits right now?

reddit.com › r › ClaudeAI › comments › 1ppn7yb › is_anyone_else_getting_absolutely_wrecked_by

Images eat way more tokens than you expect. A single screenshot can be 1000+ tokens. If it suddenly got worse, could be a temporary backend issue. I hit something similar last week and it cleared up after a day. Answer from Afraid-Today98 on reddit.com

reddit.com › r/singularity › claude 3 context window is a big deal

r/singularity on Reddit: Claude 3 context window is a big deal

March 4, 2024 -

I use AI a lot in cases where I need a bit more than 16k input length (GPT3.5's context window limit). GPT3.5's performance is normally fine for me, but I have to use GPT4 to get a longer context window, at a much increased inference price for the many queries I end up racking up over a long session.

The Claude 3 family of models are the first ones that seem to have very respectable performance and have longer (200k) context windows across the entire family (Opus + Sonnet + Haiku). So I'm very excited about the 'Sonnet' model (the middle quality model).

TLDR: It's exciting to see the benchmark results of Opus, but I think Sonnet might enable more new real world use cases than Opus, when considering the context window and the relatively low cost.

Context length is one thing, but the LLMs differ greatly by attention distribution. Many suffer from the needle in the haystack problem, ie. how well can they recall from the middle of a long prompt input. Let's see how Claude 3 handles this, 2.1 was already pretty good at it. https://www.anthropic.com/news/claude-2-1-prompting

Yup, I just busted through a huge stack of notes that I couldn't easily get to proc in gpt4. Hot and this is amazing. This all just goes to show you how fast this shi shi can move.

reddit.com › r/perplexity_ai › perplexity limits the claude 3 opus context window to 30k tokens

r/perplexity_ai on Reddit: Perplexity limits the Claude 3 Opus Context window to 30k tokens

March 22, 2024 -

I've tested it a few times, and when using Claude 3 Opus through perplexity, it absolutely limits the context length from 200k to ~30k.

On a codebase of 110k tokens, using Claude 3 Opus through Perplexity, it would consistently (and I mean every time of 5 attempts) say that the last function in the program was one that was located about 30k tokens in.

When using Anthropic's API and their web chat, it consistently located the actual final function and could clearly see and recall all 110k tokens of the code.

I also tested this with 3 different books and 2 different codebases and received the same results across the board.

I understand if they have to limit context to offer it unlimited, but not saying that anywhere is a very disappointing marketing strategy. I've seen the rumors of this but I just wanted to add another data point of confirmation that the context window is limited to ~30k tokens.

Unlimited access to Claude 3 Opus is pretty awesome still, as long as you aren't hitting that context window, but this gives me misgivings about what else Perplexity is doing to my prompts under the hood in the name of saving costs.

Did you use Writing focus, Pro Search toggle off? Because I was able to push the limits on over 150k tokens.

I wish companies were more transparent about these kinds of numbers. Though I love that they actually give you a remaining-use tracker.

Videos

Claude Code: 3 Unlocks to Level 20 - YouTube

Claude Code Tutorial #3 - Context - YouTube

August 25, 2025

Claude Just 5x'd Its Context Window - YouTube

August 14, 2025

Varun Mayya | Claude 3 opus is so good at writing, esp the ...

Prompt Engineering with the Anthropic Claude 3 Opus API : Part ...

Claude 3 by Anthropic: Test Driving Opus, the ChatGPT Competitor ...

reddit.com › r/claudeai › question about claude-3-opus context window on poe

r/ClaudeAI on Reddit: Question about Claude-3-Opus context window on Poe

August 13, 2022 -

Hi everyone,

I'm considering using the Claude-3-Opus model on Poe, but I have a question about the context window size for the 1000 credit "shortened" version compared to the full 200k token version that costs 6000 credits on Anthropic.

Since I'm located in Europe, I don't have direct access to Anthropic to use the full Opus model. So I'm trying to determine if the Poe version with the smaller context window will still meet my needs.

Does anyone happen to know approximately how many tokens the context window is limited to for Claude-3-Opus on Poe? Any insight would be greatly appreciated as I try to decide if it will be suitable for my use case.

Thanks so much for any info you can provide!

POE just doubled the credit for Claude-3. 🤬 Now Claude-3-Opus-200k require 12000 credit and Claude-3-Opus require 2000 credit per message

I have the same question regarding the nebulous context window when not using the Opus-200k. Poe upped the price per message for Claude 3 Opus-200k a few days after its launching. It used to be 1750 tokens/message and now it's 6000/message. Context window is important. It should be communicated...

reddit.com › r/claudeai › is anyone else getting absolutely wrecked by claude’s context limits right now?

r/ClaudeAI on Reddit: Is anyone else getting absolutely wrecked by Claude’s context limits right now?

1 week ago -

I’m on Opus 4.5 with Max. Every time I add an image or try to do a slightly serious multi-step task, I get hit with “Context size exceeds the limit. I even tested with a simple single image and was still having issues, super frustrating. Also tried reducing the number of files or content in the conversation,” and was followed by the “Compacting our conversation so we can keep chatting…” spinner after just a few messages.

It was absolutely on fire the last few days – long, complex sessions with multiple files, no issues at all. Then out of nowhere, it starts compacting almost immediately, even if I’m only working off a single image. With a supposed 200k+ context window, this makes zero sense from the user side.

I’ve tried pretty much everything: Opus 4.5 on Max, desktop app, web app, different projects/folders, disabling connectors, restarting, fresh chats, different prompt styles. Same story every time as soon as the convo starts getting butchered by aggressive compaction and length limit warnings.

Is this some bug, server-side issue, or a quiet change to how they’re counting tokens, especially for images and file attachments? Anyone figured out a reliable workaround beyond “new chat every few minutes” or stripping everything down to plain text?

Would love to hear if others are seeing the same thing or if there’s a smarter way to work around these context shenanigans.

Images eat way more tokens than you expect. A single screenshot can be 1000+ tokens. If it suddenly got worse, could be a temporary backend issue. I hit something similar last week and it cleared up after a day.

It’s a big bottleneck that really limits what a user can do with Claude. I have switched over to Claude from Chat and it’s a glaring weakness. For example: I built a system in Chat where I fed it hundreds of quarterly earnings transcripts and also government budgets. The purpose was to identify cross sector/cross asset themes (eg, AI, energy, aerospace & defense). Unfortunately the links to the transcripts are bot protected so I had to upload the actual PDF transcripts. Chat happily ingested them in fairly big chunks (IIRC 10 at a time). Claude loses its shit when I upload more than TWO transcripts at a time. Furthermore, it can only read and analyze two transcripts at a time before the window reaches its limit. The ACTUAL analysis is better in Claude but its window limitations make it much less efficient.

reddit.com › r/claudeai › has the context window been reduced for claude code?

r/ClaudeAI on Reddit: Has the context window been reduced for Claude Code?

2 weeks ago -

I am on the $100 plan using opus 4.5. Good experience so far but I am noticing that I am running out of context WAY faster. Not sure if this is because of Opus, because of my project, or because I downgraded from the $200 plan. Any ideas?

Same context size. Do you have thinking always enabled? Or a large CLAUDE.md or lots of rules, custom agents, skills etc?

I think the context window is still 200k

reddit.com › r/openai › my review of claude 3 opus

r/OpenAI on Reddit: My Review of Claude 3 Opus

February 28, 2024 -

After the issues that had been plaguing me do the general laziness of GPT-4 I had allowed my subscription to lapse and purchased a claude 3 opus subscription from Anthropic. At first I was simply amazed at how accurate the model was compared to the then gimped GPT-4 though I quickly realized that the model and the underlying service had some key issues such as their usage policy which limits the number of prompts In a 5 hour 'at the time I signed up it was 8' period if you upload certain files to it. Which I do quite frequently since it makes it easier to provide some context for any task by uploading a file. So your 45 message limit can quickly become 10 if you don't understand how the context affects the message limit. Furthermore one of the primary selling points of Claude is its large context which is effectively Tantalian curse in the sense that the context is close yet so far we have 200k context to play with but due to the aforementioned usage policy we cannot make practical use of it.

Many will say use the API but the costs are simply absurd if you intend to make the API version of Claude your daily driver. Also Claude tends to be very verbose when it replies to you and the UI of their flagship app leaves much to be desired. Finally the lack of web browsing in Claude means you have to manually verify the output and since Claude is regarded so highly for its intellect it may result in your trusting output you shouldn't.

Throughout it all I was prepared to keep my subscription until the king returned with GPT 4 Turbo w/ vision 2024-04-09 which fixed every major issue I had with the previous model of GPT 4 that I had originally left for Claude, the clear and capable code, the ability to read files with an expanded context without issue, it all became clear that even though Claude may be superior to GPT 4 in some ways the scale of the underlying companies makes GPT 4 the superior choice. Not to mention it took the other companies so long to surpass GPT 4 that was trained on lackluster hard ware what will GPT 5 look like?

This has been me personal journey a well. Even before the update, I found that custom gpt got the job done 90% of the time. Note: I didn't do heavy coding. I didn't actually need 90k token window. I only needed it to remember key info from session to session. So the custom instructions worked fine for me. Then the update rolled out. I personally didn't see a higher diff but others have noted the improvement.

The only thing I use Claude 3 Opus (API) for is writing. Its output is more natural than GPT-4. It can also write nsfw easily. Anything else I use GPT-4.

reddit.com › r/claudeai › which platform offers highest number of claude 3 opus uses?

r/ClaudeAI on Reddit: Which Platform Offers Highest Number of Claude 3 Opus Uses?

June 12, 2024 -

I want to use Claude 3 Opus with the full 200k context window. I feel like no other AI even comes close to it in terms of creativity and realistic narration, which is what I want as a novelist wishing for an AI assistant. So far though, I have been using C3O through Perplexity Pro only. The official Claude website doesn't have the option to edit/delete message sent, so it is a big pain in the ass for my approach, as I try multiple prompts one by one to play out scenes and see which one is more suitable for the chapter and stuff like that. Due to that I opted for Perplexity over the official platform to access Claude 3 Opus. But Perplexity doesn't seem to have the 200k context window. And it has 50 C3O uses per day.

So I am looking for an alternative way to access Claude 3 Opus 200k now with the full context window, edit/delete message option and more uses than 50 for C3O (unlimited is the dream, but I know that is not an option yet for Opus).

API console, tradeoff is cost.

Claude website just got updated and message editing is added

reddit.com › r/aipromptprogramming › perplexity limits the claude 3 opus context window to 30k tokens

r/aipromptprogramming on Reddit: Perplexity limits the Claude 3 Opus Context window to 30k tokens

December 25, 2022 - I've seen the rumors of this but I just wanted to add another data point of confirmation that the context window is limited to ~30k tokens. Unlimited access to Claude 3 Opus is pretty awesome still, as long as you aren't hitting that context window, but this gives me misgivings about what else Perplexity is doing to my prompts under the hood in the name of saving costs.

Find elsewhere

Google Bing Mojeek

reddit.com › r/openai › claude 3 opus > chatgpt4

r/OpenAI on Reddit: Claude 3 Opus > ChatGPT4

June 20, 2023 -

There have been quite a few threads discussing how these two models perform, so I thought I'd share my experience.

I've managed to get a subscription for Claude via PVN, even though Europe is not yet included. Thanks to Apple Pay as I was unable to pay via card directly due to address restrictions.

After using it for a couple of days, mostly for Python coding and a little for writing, to my surprise, I actually found it better than GPT-4.

Since I can't think of any negative aspects, I'll list what I liked instead:

I only hit the cap limit once, and it was probably due to a large context within the sessions rather than the message cap.
It always performs as you ask it. It always returns complete functions, etc.
The code it outputs almost always works the first time. Really nice!
The code, overall, looks a bit nicer and is better organized, with more relevant variable names, etc.

Given that the price is the same, I think it's a much better deal. Well, at least for now, until we have GPT-4.5.

It's really worth a shot :)

GPT 4 seems a bit stronger on reasoning still

There are several objective negatives you could've listed: No image generator like Dall-E No code interpreter No ability to search the internet No plugin support No customizing how the model responds

reddit.com › r/claudecode › since when does opus 4.1 has a 200k context window?

Since when does Opus 4.1 has a 200k context window? : r/ClaudeCode

September 10, 2025 - Have been using Claude Opus for 12 hours a day for about 4 months straight and have never had an issue, now suddenly after one prompt, it's needing to compact after just planning, not even coding. Somthing is up for sure. Continue this thread Continue this thread ... It has never had a 1 million context window...

reddit.com › r/claudeai › the "message limit" for claude 3 opus is too frustratingly low, there has to be some practical options!

r/ClaudeAI on Reddit: The "message limit" for Claude 3 Opus is too frustratingly low, there has to be some practical options!

April 15, 2024 -

I find myself reaching that cursed "Message limit reached for Claude 3 Opus" too often, and it's really frustrating because I've found Claude quite pleasant to interact and work with. I'm wondering, can't Anthropic at least provide the option to pay extra when needing to go over quota, rather than just being forced to stop in the middle of a productive conversation? Kind of like what phone companies do when you need more data than the package you've paid for allows...

The key is to start a new session using the output of the previous session after each response. This is because Claude submits the entire conversation history to the LLM each time you write a new prompt in a conversation. This in turn reduces the number of tokens you have left before you get the warning.

I think they are focusing more on quality than quantity. Gpt has a lot more message limit but responses are of low quality.

en.wikipedia.org › wiki › Claude_(language_model)

Claude (language model) - Wikipedia

6 days ago - The Claude 3 family includes three models in ascending order of capability: Haiku, Sonnet, and Opus. The default version of Claude 3, Opus, has a context window of 200,000 tokens, but this could be expanded to 1 million for specific use cases.

reddit.com › r/claudeai › dear anthropic... please increase the context window size.

r/ClaudeAI on Reddit: Dear Anthropic... PLEASE increase the context window size.

August 12, 2025 -

Signed everyone that used Claude to write software. At least give us an option to pay for it.

Edit: thank you Anthropic!

I use Claude to write software every day. I also have used Gemini and its million token window size. I…uhhh… don’t think you know what the fuck you are talking about.

Ive coded on claude code for like 4 months now, I've never once thought "if only the context window was larger". Your application is stupid unoptimized and unmodularized if you can't deal with a frankly huge 200kb workspace. Skill issue

reddit.com › r/claudeai › is 200k the real context window for non-api users?

r/ClaudeAI on Reddit: Is 200k the real context window for non-API users?

May 29, 2024 -

I've looked around for this answer and I see a lot of conflicting things. Some people at least are saying that you only actually get 200k context with Opus if you're using the API. Normal Pro users just using Claude on the website only get a portion of that. That's what some people say, and I'm just curious if anyone knows the real answer.

The reason I ask is, I'm using Opus as a Pro member on the website, and even when I'm only like 15k tokens into a conversation, it starts telling me the conversation is getting too long and I should start a new one. 15k seems like a very tiny piece of 200k.

Cluade is extremely stingy in terms of thier quotas and not to mentioned a bit abstract -- especially when it comes to reading an uploaded image or pdf -- their quota is crazy too like oh you have 6 left for the next 4 hours which is like at least half of the work day if you start early - sigh.

I temporarily unsubbed weeks ago so that might be a new thing that I haven't seen, but I imagine that feature might just be to encourage you not to let the conversation run longer than it absolutely needs to, because otherwise you will hit your rate limit. A bunch of sufficiently large smaller queries within a conversation can easily bring you to your rate limit more quickly than one big 100k or even 200k token query, because in a conversation all of your messages get resubmitted every time. I have attached entire books to conversations with Claude 3 Opus (in some cases multiple books). I never went all the way up to the limit because I tried to avoid having to do that, but if the GPT-4 token calculator I was using is accurate for Claude, I was definitely using at least 130k tokens in one conversation for example

reddit.com › r/claudeai › context length limits finally solved!

r/ClaudeAI on Reddit: Context Length limits finally SOLVED!

November 24, 2025 -

With the introduction of Opus 4.5, Anthropic just updated the Claude Apps (Web, Desktop, Mobile):

For Claude app users, long conversations no longer hit a wall—Claude automatically summarizes earlier context as needed, so you can keep the chat going.

This is so amazing and was my only gripe I had with Claude (besides limits), and why I kept using ChatGPT (for the rolling context window).

Anyone as happy as I am?

People on this sub have reported that when it compresses it, you lose a lot of performance and context from the compressed data... I wouldn't celebrate just yet.

We acquired & solved Context length limits and Opus limits before GTA6. 2025 has been something else.

reddit.com › r/chatgptcoding › claude 3: opus as well as sonnet are far better than gpt4

r/ChatGPTCoding on Reddit: Claude 3: Opus as well as sonnet are far better than GPT4

December 12, 2023 -

I’ve been using gpt 4 for quite a very long time and what I observe it’s just stupid when it comes to context window and memory allocation, whereas Claude is actually brilliant.

I have heard this so many times and yet my experience with Claude and coding is hit and miss. I don’t know what you guys see that I am missing, but at best it seems like a marginal difference with gpt4, worse for some things, better for others. In general, these posts are useless because its so anecdotal that they add little value to the conversation. If you want a structured approach, find a coding benchmark or something to motivate the argument.

Except it's not. There's a few things it does marginally better and many it does worse on. It's a good model and it has it's uses but it has some serious issues.

reddit.com › r/claudeai › claude 3 opus is stellar but i'm not thrilled about this. i've only managed to reach a limit with chatgpt 4 once and i was coding many hours longer than i have been with claude.

r/ClaudeAI on Reddit: Claude 3 Opus is stellar but I'm not thrilled about this. I've only managed to reach a limit with ChatGPT 4 once and I was coding many hours longer than I have been with Claude.

September 28, 2023 - Opus is noticeable superior to Sonnet and it takes BIG context to hit this issue. I've had long discussions with one or two uploaded documents. Only with massive context does this happen and the close to unique feature of Claude is it can be relied upon to remember all its context

reddit.com › r/claudecode › the context window simply needs to be larger with opus as the default

r/ClaudeCode on Reddit: The context window simply needs to be larger with Opus as the default

November 25, 2025 -

I'm on the 20x Max plan. I get that Opus will use tokens faster, and Anthropic acknowledged this by increasing the total token usage to be something equivalent to the same amount of usage you'd get with Haiku (whether or not that is really true remains to be seen). However, they didn't raise the context window token limit of 200k (I don't have access to the 1M limit).

I just used my first prompt (which was a pretty standard one for me) to help find an issue that threw an error on my front-end, and after its response (which wasn't that helpful), I'm already down to 9% context remaining before auto-compacting.

If Anthropic is going to acknowledge that token consumption will be higher with Opus and scale some of the limits up accordingly, they really should increase the context limit as well.

Opus 4.5 does not use tokens faster. In fact in many cases, Anthropic tests showed Opus uses fewer tokens (because it gets to the right answer more directly). Set to a medium effort level, Opus 4.5 matches Sonnet 4.5’s best score on SWE-bench Verified, but uses 76% fewer output tokens. At its highest effort level, Opus 4.5 exceeds Sonnet 4.5 performance by 4.3 percentage points—while using 48% fewer tokens. And since Opus 4.5 is also less expensive per token than Opus 4.1, this means it often costs the same or less in $$ when paying for raw tokens for Opus 4.5 than for Sonnet 4.5. That is why their raised the usage limits. The old limits were to stop people from using a very expensive and not more efficient model. The new model is more efficient and more effective and doesn't cost them more on average.

Preserve the main context for actual useful work - Use sub agents, the main should basically be the orchestration and leave all the 'grunt work' to subs.

reddit.com › r/claudeai › it’s getting harder and harder to defend the 200k context window guys…

r/ClaudeAI on Reddit: it’s getting harder and harder to defend the 200K context window guys…

July 14, 2025 -

We have to be doing better than FELON TUSK , right? Right?

I would love a larger context window, but my guess is they have so many people using CC now that any increase they make to their context window will have a large impact on their infrastructure and costing. By maintaining it low before competitors get better (i.e Gemini CLI) they can continue protecting their overheads. For sure if Google get Gemini CLI close enough to CC you will see a change. Right now it still gets itself in an all mighty knot often (Gemini). So my guess is we wait, try hacks and workarounds for optimising context until the competition heats up more.

Why is it, do you think 1 million context window is actually functional?? They all get to 200-250 and become terrible and non functional 1 million context window or two million etc is fugazi

reddit.com › r/artificialinteligence › claude 3 opus via anthropic or via perplexity?

r/ArtificialInteligence on Reddit: Claude 3 Opus via Anthropic or via Perplexity?

June 11, 2024 -

I'm wondering when it makes sense to access Claude 3 Opus directly through Anthropic or via Perplexity.

My main priority is context window. If I can achive maximum context window via Perplexity, that seems like a good deal to me.

Edit: Perplexity answered this question. Context window is limited when accessing Claude via Perplexity: https://www.perplexity.ai/search/What-is-the-lLCnpBrnQJuNttxgo8aXbw

Why Claude 3 opus? Why not other LLMs? Use Poe. They give you credits for multiple LLMs

Perplexity limits window to about 32k