My biggest gripe with Sonnet 3 and 3.5 was the usage limit in the webapp -- I had to cancel my subscription because it just couldn't keep up.
How is it now? Has anyone maxed it out yet? If it's remotely better, then I'll consider resubscribing now..
I've been experimenting with Claude 3.7 and I'm genuinely impressed with its capabilities. The quality of responses and reasoning is excellent, especially for coding tasks. However, as a free user, I'm finding it practically unusable due to the severe rate limits.
I can only get through about 1-2 coding prompts per day before hitting the limit. This makes it impossible to have any meaningful ongoing development session or troubleshooting conversation.
I would happily pay for a subscription if the context window was significantly larger. The current 8k token limit is simply too restrictive for serious work. For comparison, I regularly use Gemini 2.0 Pro which offers a 2 million token context window, allowing me to include entire codebases and documentation in my prompts. Look at grok and GPT-o3-mini, both models are comparable in terms of quality and i get many times the usage as a free user, grok 3 has 50 normal prompts a day and 10 thinking prompts a day, 03-mini gets unlimited 4o mini, tens of thousands of 4o tokens, and over a dozen 03 prompts, without paying a dime, all models having a much larger context window.
With just 8k tokens, I can barely fit a moderate-sized function and its related components before running out of space. Let along giving Claude frontend context. This means constantly having to reframe my questions and lose context, making complex programming tasks frustratingly inefficient.
Does anyone else feel the same way? I want to support Claude and would gladly pay for a better experience, but the current limitations make it hard to justify even for a paid tier.
Videos
Now that we have both 3.5 and 3.7, does anyone know what are the limits now? If 3.7 hits limit, can we use 3.5?
Since some people have been asking, here's the actual output limit for Sonnet 3.7 with and without thinking:
Non-thinking: 8192 tokens
Non-thinking chat: https://claude.ai/share/af0b52b3-efc3-452b-ad21-5e0f39676d9f
Thinking: 24196 tokens*
Thinking chat: https://claude.ai/share/c3c8cec3-2648-4ec4-a13d-c6cce7735a67
*The thinking tokens don't make a lot of sense to me, as I'd expect them to be 3 * 8192 = 24576, but close enough I guess. Also in the example the thinking tokens itself are 23575 before being cut off in the main response, so thinking alone may actually be longer.
Tokens have been calculated with the token counting API and subtracting 16 tokens (role and some other tokens that are always present).
Hope this helps and also thanks to the discord mod, that shall not be pinged, for the testing prompt.
We've just reset weekly limits for all Claude users on paid plans.
We've seen members of this community hitting their weekly usage limits more quickly than they might have expected. This is driven by usage of Opus 4.1, which can cause you to hit the limits much faster than Sonnet 4.5.
To help during this transition, we've reset weekly limits for all paid Claude users.
Our latest model, Sonnet 4.5 is now our best coding model and comes with much higher limits than Opus 4.1. We recommend switching your usage over from Opus, if you want more usage. You will also get even better performance from Sonnet 4.5 by turning on "extended thinking" mode. In Claude Code, just use the tab key to toggle this mode on.
We appreciate that some of you have a strong affinity for our Opus models (we do too!). So we've added the ability to purchase extra usage if you're subscribed to the Max 20x plan. We’ll put together more guidance on choosing between our models in the coming weeks.
We value this community’s feedback. Please keep it coming – we want our models and products to work well for you.
This Megathread is to discuss your thoughts, concerns and suggestions about the changes involving the Weekly Usage Limits implemented alongside the recent Claude 4.5 release. Please help us keep all your feedback in one place so we can prepare a report for Anthropic's consideration about readers' suggestions, complaints and feedback. This also helps us to free the feed for other discussion. For discussion about recent Claude performance and bug reports, please use the Weekly Performance Megathread instead.
Please try to be as constructive as possible and include as much evidence as possible. Be sure to include what plan you are on. Feel free to link out to images.
Recent related Anthropic announcement : https://www.reddit.com/r/ClaudeAI/comments/1ntq8tv/introducing_claude_usage_limit_meter/
Original Anthropic announcement here: https://www.reddit.com/r/ClaudeAI/comments/1mbo1sb/updating_rate_limits_for_claude_subscription/