Claude Pro Users, how are the Sonnet 3.7 usage limits?

reddit.com › r › ClaudeAI › comments › 1iy12bw › claude_pro_users_how_are_the_sonnet_37_usage

They *seem* noticeably higher. I haven't hit them and I've been playing with it all day to see what it can do. I think I would've normally hit them a while ago - even with judicious use of MCP and resetting chats etc. Answer from CelebrationSecure510 on reddit.com

reddit.com › r/claudeai › claude pro users, how are the sonnet 3.7 usage limits?

r/ClaudeAI on Reddit: Claude Pro Users, how are the Sonnet 3.7 usage limits?

February 25, 2025 -

My biggest gripe with Sonnet 3 and 3.5 was the usage limit in the webapp -- I had to cancel my subscription because it just couldn't keep up.

How is it now? Has anyone maxed it out yet? If it's remotely better, then I'll consider resubscribing now..

Top answer

1 of 9

2 of 9

I have yet to max out, it’s been over a day of pretty consistent usage. Also its responses can be much much longer now if necessary (like for coding or creative writing) or anything else that would warrant a longer response. So far I tho l it’s worth it. But we’ll see if anything gets changed in the upcoming days.

reddit.com › r/claudeai › i love claude 3.7... just one problem: the rate limits make it unusable

r/ClaudeAI on Reddit: I love Claude 3.7... just one problem: The rate limits make it unusable

February 27, 2025 -

I've been experimenting with Claude 3.7 and I'm genuinely impressed with its capabilities. The quality of responses and reasoning is excellent, especially for coding tasks. However, as a free user, I'm finding it practically unusable due to the severe rate limits.

I can only get through about 1-2 coding prompts per day before hitting the limit. This makes it impossible to have any meaningful ongoing development session or troubleshooting conversation.

I would happily pay for a subscription if the context window was significantly larger. The current 8k token limit is simply too restrictive for serious work. For comparison, I regularly use Gemini 2.0 Pro which offers a 2 million token context window, allowing me to include entire codebases and documentation in my prompts. Look at grok and GPT-o3-mini, both models are comparable in terms of quality and i get many times the usage as a free user, grok 3 has 50 normal prompts a day and 10 thinking prompts a day, 03-mini gets unlimited 4o mini, tens of thousands of 4o tokens, and over a dozen 03 prompts, without paying a dime, all models having a much larger context window.

With just 8k tokens, I can barely fit a moderate-sized function and its related components before running out of space. Let along giving Claude frontend context. This means constantly having to reframe my questions and lose context, making complex programming tasks frustratingly inefficient.

Does anyone else feel the same way? I want to support Claude and would gladly pay for a better experience, but the current limitations make it hard to justify even for a paid tier.

Top answer

1 of 5

Let’s face it, you don’t want to/can’t afford the $20 for the 200K context token limit. I’m shocked at these people who don’t realize the amount of value that they get from these services. “It’s the greatest thing ever! But I don’t want to pay the price of a burger and fries every month!”

2 of 5

Claude has a 200k token context window. Its output tokens which were capped at 8192, though with 3.7 that goes up to 64.000 now.

Videos

08:24

YouTube

Testing Claude 3.7's Limits: I Built a Cursor Rules Scraper with ...

Claude Sonnet 3.7 is.. kinda bad? - YouTube

February 27, 2025

12:51

YouTube

Actually coding with Claude 3.7 is actually insane, actually. - ...

February 26, 2025

youtube.com

Build Anything with Claude 3.7, Here’s How - YouTube

February 25, 2025

09:32

YouTube

Claude Code Is Finally Here — And It’s INSANE! - YouTube

February 24, 2025

09:47

YouTube

Claude 3.7 Sonnet: The BEST Coding LLM Ever! (Fullly Tested) - ...

February 24, 2025

View all

Anthropic

anthropic.com › news › claude-3-7-sonnet

Claude 3.7 Sonnet and Claude Code

Second, when using Claude 3.7 Sonnet through the API, users can also control the budget for thinking: you can tell Claude to think for no more than N tokens, for any value of N up to its output limit of 128K tokens.

Simon Willison

simonwillison.net › 2025 › Feb › 25 › llm-anthropic-014

Claude 3.7 Sonnet, extended thinking and long output, llm-anthropic 0.14

February 25, 2025 - (This is the output limit—how much text it can produce in one go. Claude 3.7 Sonnet’s input limit remains 200,000—many modern models exceed 100,000 for input now.)

reddit.com › r/claudeai › sonnet 3.7 limits info?

r/ClaudeAI on Reddit: Sonnet 3.7 limits info?

August 11, 2024 -

Now that we have both 3.5 and 3.7, does anyone know what are the limits now? If 3.7 hits limit, can we use 3.5?

Top answer

1 of 8

The artefacts window seems longer, which is welcome news.

2 of 8

Probably the same limits. Also helps that the announcement post and video pretty much screamed at you to use the API. They really don't care about the claude.ai service.

reddit.com › r/claudeai › claude 3.7 output limit in ui

r/ClaudeAI on Reddit: Claude 3.7 output limit in UI

March 3, 2025 -

Since some people have been asking, here's the actual output limit for Sonnet 3.7 with and without thinking:
Non-thinking: 8192 tokens
Non-thinking chat: https://claude.ai/share/af0b52b3-efc3-452b-ad21-5e0f39676d9f

Thinking: 24196 tokens*
Thinking chat: https://claude.ai/share/c3c8cec3-2648-4ec4-a13d-c6cce7735a67

*The thinking tokens don't make a lot of sense to me, as I'd expect them to be 3 * 8192 = 24576, but close enough I guess. Also in the example the thinking tokens itself are 23575 before being cut off in the main response, so thinking alone may actually be longer.

Tokens have been calculated with the token counting API and subtracting 16 tokens (role and some other tokens that are always present).

Hope this helps and also thanks to the discord mod, that shall not be pinged, for the testing prompt.

Top answer

1 of 3

It's not 128k for thinking on output?

2 of 3

Very useful info. I believe in the API, if you explicitly specify max tokens and token thinking budget, it will aim to reach those rather than them being merely limits. Says in the docs somewhere

16x Prompt

prompt.16x.engineer › blog › claude-daily-usage-limit-quota

What's Claude AI Daily Usage Limit Quota? (Free vs Pro) | 16x Prompt

This opens up possibilities for ... quite generous compared to the web interface, with Tier 1 users having a limit of 50 requests per minute to Claude 3.5 Sonnet....

Anthropic

docs.anthropic.com › en › docs › about-claude › models › extended-thinking-models

Building with extended thinking - Anthropic

With Claude 3.7 and 4 models, max_tokens (which includes your thinking budget when thinking is enabled) is enforced as a strict limit. The system will now return a validation error if prompt tokens + max_tokens exceeds the context window size.

Find elsewhere

Google Bing Mojeek

Cursor

forum.cursor.com › discussions

Strong Discontent with Rate Limits on Claude 3.7 Sonnet for Slow Pool Users - Discussions - Cursor - Community Forum

December 21, 2024 - I would like to express my strong dissatisfaction with the current rate-limiting issues that are preventing slow pool users from accessing Claude 3.7 Sonnet. The model has immense potential, and it’s frustrating to see that users like myself, who are part of the slow pool, are unable to utilize ...

AWS

docs.aws.amazon.com › amazon bedrock › user guide › amazon bedrock foundation model information › inference request parameters and response fields for foundation models › anthropic claude models › *new* anthropic claude 3.7 sonnet

*NEW* Anthropic Claude 3.7 Sonnet - Amazon Bedrock

With Claude 3.7 Sonnet, max_tokens (which includes your thinking budget when thinking is enabled) is enforced as a strict limit. The system will now return a validation error if prompt tokens + max_tokens exceeds the context window size. When calculating context window usage with thinking enabled, ...

reddit.com › r/claudeai › update on usage limits

r/ClaudeAI on Reddit: Update on Usage Limits

October 1, 2025 -

We've just reset weekly limits for all Claude users on paid plans.

We've seen members of this community hitting their weekly usage limits more quickly than they might have expected. This is driven by usage of Opus 4.1, which can cause you to hit the limits much faster than Sonnet 4.5.

To help during this transition, we've reset weekly limits for all paid Claude users.

Our latest model, Sonnet 4.5 is now our best coding model and comes with much higher limits than Opus 4.1. We recommend switching your usage over from Opus, if you want more usage. You will also get even better performance from Sonnet 4.5 by turning on "extended thinking" mode. In Claude Code, just use the tab key to toggle this mode on.

We appreciate that some of you have a strong affinity for our Opus models (we do too!). So we've added the ability to purchase extra usage if you're subscribed to the Max 20x plan. We’ll put together more guidance on choosing between our models in the coming weeks.

We value this community’s feedback. Please keep it coming – we want our models and products to work well for you.

Top answer

1 of 5

174

Thank you, but can you confirm whether we still have access to 25-40 hours of Opus for typical use as stated in your documentation here: https://support.claude.com/en/articles/11145838-using-claude-code-with-your-pro-or-max-plan Can you confirm yes or no? So for typical use, single session with no subagents, can we expect to hit 25-40 hours of Opus? Also, Sonnet should provide 240-480 hours of typical use? Yes or no?

2 of 5

171

Well thanks for confirming its time to cancel subscription.

GitHub

github.com › anthropics › claude-code › issues › 15064

Claude AI usage limit reached and won't reset|1766502000 · Issue #15064 · anthropics/claude-code

3 days ago - Please try again later.\"},\"request_id\":\"req_011CWMy4AuP1JEoDD2ZvDnU6\"}\n at new M2 (unknown:1:28)\n at new hB (B:/~BUN/root/claude.exe:1150:5968)\n at new W00 (unknown:1:28)\n at generate (B:/~BUN/root/claude.exe:1150:6630)\n at makeRequest (B:/~BUN/root/claude.exe:1167:5327)\n at processTicksAndRejections (native:7:39)","timestamp":"2025-12-22T15:32:14.870Z"},{"error":"Error: 429 {\"type\":\"error\",\"error\":{\"type\":\"rate_limit_error\",\"message\":\"This request would exceed your account's rate limit.

Published Dec 22, 2025

Substack

simonw.substack.com › p › claude-37-sonnet-extended-thinking

Claude 3.7 Sonnet, extended thinking and long output

February 25, 2025 - (This is the output limit - how much text it can produce in one go. Claude 3.7 Sonnet's input limit remains 200,000 - many modern models exceed 100,000 for input now.)

TypingMind

blog.typingmind.com › home › claude rate exceeded: guide to fix and prevent the error

Claude Rate Exceeded: Guide to Fix and Prevent the Error

October 9, 2025 - If you encounter the “Rate Exceeded” error when using Claude, here are the most effective ways to resolve it immediately: Shorten your input prompt or reduce the maximum output tokens. Longer prompts and responses consume more tokens and can quickly exceed your per-minute token limit. If you’re making multiple calls to the API or through an automation workflow, add short delays between requests. Sending requests too quickly can trigger rate limits even when total usage is within quota.

reddit.com › r/claudeai › claude is giving us a 2x usage limit until december 31st. thank you, thank you.

r/ClaudeAI on Reddit: Claude is giving us a 2X usage limit until December 31st. Thank you, thank you.

1 day ago - Had I known this in advance I would have used up what I had. (Also right after the reset I accidently typed "quit;" instead of quit into Claud Code and it ate up 9% of my 5 hour quota.

reddit.com › r/claudeai › usage limits discussion megathread - beginning sep 30, 2025

r/ClaudeAI on Reddit: Usage Limits Discussion Megathread - beginning Sep 30, 2025

October 1, 2025 -

This Megathread is to discuss your thoughts, concerns and suggestions about the changes involving the Weekly Usage Limits implemented alongside the recent Claude 4.5 release. Please help us keep all your feedback in one place so we can prepare a report for Anthropic's consideration about readers' suggestions, complaints and feedback. This also helps us to free the feed for other discussion. For discussion about recent Claude performance and bug reports, please use the Weekly Performance Megathread instead.

Please try to be as constructive as possible and include as much evidence as possible. Be sure to include what plan you are on. Feel free to link out to images.

Recent related Anthropic announcement : https://www.reddit.com/r/ClaudeAI/comments/1ntq8tv/introducing_claude_usage_limit_meter/

Original Anthropic announcement here: https://www.reddit.com/r/ClaudeAI/comments/1mbo1sb/updating_rate_limits_for_claude_subscription/

UPDATE: Anthropic have posted an update here :

https://www.reddit.com/r/ClaudeAI/comments/1nvnafs/update_on_usage_limits/

Top answer

1 of 5

Just cancelled, this is bullshit, honestly.

2 of 5

It feels good to be in the 2% of affected users. With my $20 plan, I’m hitting 32% of my weekly limit in just 2-3 hours of coding. Great!

Apidog

apidog.com › blog › claude-pro-limits

What Are Claude Pro Limits and How to Bypass Them:

July 30, 2025 - While the maximum size of the context window (e.g., 200,000 tokens for Claude 3.7 Sonnet on Pro) is an architectural feature of the model version available on your plan, how much of that window you actively utilize in any given interaction directly ...

DataCamp

datacamp.com › blog › claude-3-7-sonnet

Claude 3.7 Sonnet: How it Works, Use Cases & More | DataCamp

February 25, 2025 - Free users can access Claude 3.7 Sonnet for basic tasks like writing, summarization, and general Q&A, but Thinking Mode is disabled. Claude Pro users (the $20/month paid plan) get full access to Thinking Mode, along with higher message limits and priority access during peak usage times.

Anthropic

docs.anthropic.com › en › docs › build-with-claude › extended-thinking

Building with extended thinking - Claude Docs