They *seem* noticeably higher. I haven't hit them and I've been playing with it all day to see what it can do. I think I would've normally hit them a while ago - even with judicious use of MCP and resetting chats etc. Answer from CelebrationSecure510 on reddit.com
🌐
Reddit
reddit.com › r/claudeai › i love claude 3.7... just one problem: the rate limits make it unusable
r/ClaudeAI on Reddit: I love Claude 3.7... just one problem: The rate limits make it unusable
February 27, 2025 -

I've been experimenting with Claude 3.7 and I'm genuinely impressed with its capabilities. The quality of responses and reasoning is excellent, especially for coding tasks. However, as a free user, I'm finding it practically unusable due to the severe rate limits.

I can only get through about 1-2 coding prompts per day before hitting the limit. This makes it impossible to have any meaningful ongoing development session or troubleshooting conversation.

I would happily pay for a subscription if the context window was significantly larger. The current 8k token limit is simply too restrictive for serious work. For comparison, I regularly use Gemini 2.0 Pro which offers a 2 million token context window, allowing me to include entire codebases and documentation in my prompts. Look at grok and GPT-o3-mini, both models are comparable in terms of quality and i get many times the usage as a free user, grok 3 has 50 normal prompts a day and 10 thinking prompts a day, 03-mini gets unlimited 4o mini, tens of thousands of 4o tokens, and over a dozen 03 prompts, without paying a dime, all models having a much larger context window.

With just 8k tokens, I can barely fit a moderate-sized function and its related components before running out of space. Let along giving Claude frontend context. This means constantly having to reframe my questions and lose context, making complex programming tasks frustratingly inefficient.

Does anyone else feel the same way? I want to support Claude and would gladly pay for a better experience, but the current limitations make it hard to justify even for a paid tier.

🌐
Anthropic
anthropic.com › news › claude-3-7-sonnet
Claude 3.7 Sonnet and Claude Code
Second, when using Claude 3.7 Sonnet through the API, users can also control the budget for thinking: you can tell Claude to think for no more than N tokens, for any value of N up to its output limit of 128K tokens.
🌐
Simon Willison
simonwillison.net › 2025 › Feb › 25 › llm-anthropic-014
Claude 3.7 Sonnet, extended thinking and long output, llm-anthropic 0.14
February 25, 2025 - (This is the output limit—how much text it can produce in one go. Claude 3.7 Sonnet’s input limit remains 200,000—many modern models exceed 100,000 for input now.)
🌐
Reddit
reddit.com › r/claudeai › claude 3.7 output limit in ui
r/ClaudeAI on Reddit: Claude 3.7 output limit in UI
March 3, 2025 -

Since some people have been asking, here's the actual output limit for Sonnet 3.7 with and without thinking:
Non-thinking: 8192 tokens
Non-thinking chat: https://claude.ai/share/af0b52b3-efc3-452b-ad21-5e0f39676d9f

Thinking: 24196 tokens*
Thinking chat: https://claude.ai/share/c3c8cec3-2648-4ec4-a13d-c6cce7735a67

*The thinking tokens don't make a lot of sense to me, as I'd expect them to be 3 * 8192 = 24576, but close enough I guess. Also in the example the thinking tokens itself are 23575 before being cut off in the main response, so thinking alone may actually be longer.

Tokens have been calculated with the token counting API and subtracting 16 tokens (role and some other tokens that are always present).

Hope this helps and also thanks to the discord mod, that shall not be pinged, for the testing prompt.

🌐
16x Prompt
prompt.16x.engineer › blog › claude-daily-usage-limit-quota
What's Claude AI Daily Usage Limit Quota? (Free vs Pro) | 16x Prompt
This opens up possibilities for ... quite generous compared to the web interface, with Tier 1 users having a limit of 50 requests per minute to Claude 3.5 Sonnet....
🌐
Anthropic
docs.anthropic.com › en › docs › about-claude › models › extended-thinking-models
Building with extended thinking - Anthropic
With Claude 3.7 and 4 models, max_tokens (which includes your thinking budget when thinking is enabled) is enforced as a strict limit. The system will now return a validation error if prompt tokens + max_tokens exceeds the context window size.
Find elsewhere
🌐
Cursor
forum.cursor.com › discussions
Strong Discontent with Rate Limits on Claude 3.7 Sonnet for Slow Pool Users - Discussions - Cursor - Community Forum
December 21, 2024 - I would like to express my strong dissatisfaction with the current rate-limiting issues that are preventing slow pool users from accessing Claude 3.7 Sonnet. The model has immense potential, and it’s frustrating to see that users like myself, who are part of the slow pool, are unable to utilize ...
🌐
AWS
docs.aws.amazon.com › amazon bedrock › user guide › amazon bedrock foundation model information › inference request parameters and response fields for foundation models › anthropic claude models › *new* anthropic claude 3.7 sonnet
*NEW* Anthropic Claude 3.7 Sonnet - Amazon Bedrock
With Claude 3.7 Sonnet, max_tokens (which includes your thinking budget when thinking is enabled) is enforced as a strict limit. The system will now return a validation error if prompt tokens + max_tokens exceeds the context window size. When calculating context window usage with thinking enabled, ...
🌐
Reddit
reddit.com › r/claudeai › update on usage limits
r/ClaudeAI on Reddit: Update on Usage Limits
October 1, 2025 -

We've just reset weekly limits for all Claude users on paid plans.

We've seen members of this community hitting their weekly usage limits more quickly than they might have expected. This is driven by usage of Opus 4.1, which can cause you to hit the limits much faster than Sonnet 4.5.

To help during this transition, we've reset weekly limits for all paid Claude users.

Our latest model, Sonnet 4.5 is now our best coding model and comes with much higher limits than Opus 4.1. We recommend switching your usage over from Opus, if you want more usage. You will also get even better performance from Sonnet 4.5 by turning on "extended thinking" mode. In Claude Code, just use the tab key to toggle this mode on.

We appreciate that some of you have a strong affinity for our Opus models (we do too!). So we've added the ability to purchase extra usage if you're subscribed to the Max 20x plan. We’ll put together more guidance on choosing between our models in the coming weeks.

We value this community’s feedback. Please keep it coming – we want our models and products to work well for you.

🌐
GitHub
github.com › anthropics › claude-code › issues › 15064
Claude AI usage limit reached and won't reset|1766502000 · Issue #15064 · anthropics/claude-code
3 days ago - Please try again later.\"},\"request_id\":\"req_011CWMy4AuP1JEoDD2ZvDnU6\"}\n at new M2 (unknown:1:28)\n at new hB (B:/~BUN/root/claude.exe:1150:5968)\n at new W00 (unknown:1:28)\n at generate (B:/~BUN/root/claude.exe:1150:6630)\n at makeRequest (B:/~BUN/root/claude.exe:1167:5327)\n at processTicksAndRejections (native:7:39)","timestamp":"2025-12-22T15:32:14.870Z"},{"error":"Error: 429 {\"type\":\"error\",\"error\":{\"type\":\"rate_limit_error\",\"message\":\"This request would exceed your account's rate limit.
Published   Dec 22, 2025
🌐
Substack
simonw.substack.com › p › claude-37-sonnet-extended-thinking
Claude 3.7 Sonnet, extended thinking and long output
February 25, 2025 - (This is the output limit - how much text it can produce in one go. Claude 3.7 Sonnet's input limit remains 200,000 - many modern models exceed 100,000 for input now.)
🌐
TypingMind
blog.typingmind.com › home › claude rate exceeded: guide to fix and prevent the error
Claude Rate Exceeded: Guide to Fix and Prevent the Error
October 9, 2025 - If you encounter the “Rate Exceeded” error when using Claude, here are the most effective ways to resolve it immediately: Shorten your input prompt or reduce the maximum output tokens. Longer prompts and responses consume more tokens and can quickly exceed your per-minute token limit. If you’re making multiple calls to the API or through an automation workflow, add short delays between requests. Sending requests too quickly can trigger rate limits even when total usage is within quota.
🌐
Reddit
reddit.com › r/claudeai › claude is giving us a 2x usage limit until december 31st. thank you, thank you.
r/ClaudeAI on Reddit: Claude is giving us a 2X usage limit until December 31st. Thank you, thank you.
1 day ago - Had I known this in advance I would have used up what I had. (Also right after the reset I accidently typed "quit;" instead of quit into Claud Code and it ate up 9% of my 5 hour quota.
🌐
Reddit
reddit.com › r/claudeai › usage limits discussion megathread - beginning sep 30, 2025
r/ClaudeAI on Reddit: Usage Limits Discussion Megathread - beginning Sep 30, 2025
October 1, 2025 -

This Megathread is to discuss your thoughts, concerns and suggestions about the changes involving the Weekly Usage Limits implemented alongside the recent Claude 4.5 release. Please help us keep all your feedback in one place so we can prepare a report for Anthropic's consideration about readers' suggestions, complaints and feedback. This also helps us to free the feed for other discussion. For discussion about recent Claude performance and bug reports, please use the Weekly Performance Megathread instead.

Please try to be as constructive as possible and include as much evidence as possible. Be sure to include what plan you are on. Feel free to link out to images.

Recent related Anthropic announcement : https://www.reddit.com/r/ClaudeAI/comments/1ntq8tv/introducing_claude_usage_limit_meter/

Original Anthropic announcement here: https://www.reddit.com/r/ClaudeAI/comments/1mbo1sb/updating_rate_limits_for_claude_subscription/


UPDATE: Anthropic have posted an update here :

https://www.reddit.com/r/ClaudeAI/comments/1nvnafs/update_on_usage_limits/

🌐
Apidog
apidog.com › blog › claude-pro-limits
What Are Claude Pro Limits and How to Bypass Them:
July 30, 2025 - While the maximum size of the context window (e.g., 200,000 tokens for Claude 3.7 Sonnet on Pro) is an architectural feature of the model version available on your plan, how much of that window you actively utilize in any given interaction directly ...
🌐
DataCamp
datacamp.com › blog › claude-3-7-sonnet
Claude 3.7 Sonnet: How it Works, Use Cases & More | DataCamp
February 25, 2025 - Free users can access Claude 3.7 Sonnet for basic tasks like writing, summarization, and general Q&A, but Thinking Mode is disabled. Claude Pro users (the $20/month paid plan) get full access to Thinking Mode, along with higher message limits and priority access during peak usage times.
🌐
Anthropic
docs.anthropic.com › en › docs › build-with-claude › extended-thinking
Building with extended thinking - Claude Docs
With Claude 3.7 and 4 models, max_tokens (which includes your thinking budget when thinking is enabled) is enforced as a strict limit. The system will now return a validation error if prompt tokens + max_tokens exceeds the context window size.