Anthropic's API Pricing... Can They Stay Competitive?

reddit.com › r › ClaudeAI › comments › 1iil35m › anthropics_api_pricing_can_they_stay_competitive

As long as Anthropic offers better solutions for programming than the rest, they can charge whatever they want. I'll never understand the people who claim that they are leaving Claude for other services. They just can't be building anything complicated. I've tried everything out there multiple times, and nothing comes close to claude Answer from WeeklySoup4065 on reddit.com

reddit.com › r/claudeai › anthropic's api pricing... can they stay competitive?

r/ClaudeAI on Reddit: Anthropic's API Pricing... Can They Stay Competitive?

February 5, 2025 -

Question in the title... With Gemini already at a 90% cheaper than Anthropic, and now Deepseek following suit. How will users be able to justify the Claude API price?

I built out all my AI features of my software to use Anthropic, but now given the context size I'm producing and what I expect users to produce in terms of context size, it becomes harder and harder for me to justify the Anthropic price.

Have they released any news recently on breakthrough? potentially making Sonnet more affordable?

Top answer

1 of 12

2 of 12

For now, I'm still paying the Anthropic bill, even though there are some models that perform better on paper. The main issue is that most (all?) of the models that perform better are reasoning models. I haven't seen anything perform better than sonnet for agent use cases (my experience is in developing RA.Aid, which is an autonomous software developer agent.) And, of course, agents end up using the most tokens of all, so I end up sending quite a bit Anthropic's way.

reddit.com › r/claudeai › anthropic api pricing

r/ClaudeAI on Reddit: Anthropic API pricing

November 16, 2024 -

Hey everyone,

I'm using the Anthropic API with the Continue extension in VS Code while working on a Fast-Food Store Management project. My stack includes MongoDB for the backend and React, Express, and JavaScript for the frontend. This is just a personal project, I’m not planning to release it, just exploring the capabilities of the wonderful Sonnet 3.7.

I’ve noticed that as my project grows, the cost per prompt keeps increasing. From what I understand, this happens because the model needs to process a larger amount of data each time. At this point, I estimate that each prompt is costing around $3.

Does anyone know exactly how pricing is determined? Is it purely based on token usage (input + output), or are there other factors? Also, is there a more cost-effective way to handle larger projects while still leveraging the model effectively?

P.S. Not complaining at al, just curious about what to expect moving forward. Feeling pretty lucky to have access to these kinds of tools these days!

Top answer

1 of 2

Continue is a geat extension! You are correct in your assumption, total input and output tokens is what it comes down to in the end. But if you are reaching 3 usd per message, then maybe you should consider creating a new chat more often. I usually try to divide a task in parts and have one chat per task. It can be tedious, because you will have to add all the relevant context again, but you will both get better responses and save money :) Do you currently have one big conversation in one chat?

2 of 2

WHY ARE YOU FEEDING IT THE ENTIRE PROJECT EVERY TIME?? yes there's a more cost effective way, only show it 1-2 small files at a time not the entire project. Cost is ($3/1,000,000) * tokens_input + ($15/1,000,000)*tokens_out

Videos

YouTube

Which is Cheaper? Claude Pro vs. API - YouTube

July 1, 2025

04:23

YouTube

Claude 3.5 Sonnet API Cost Explained - YouTube

February 27, 2025

17:56

YouTube

Anthropic's Brilliant Move - Claude Code with Max Subscription ...

May 2, 2025

02:15

YouTube

How To Get Your Anthropic / Claude API Key (2025) - YouTube

March 24, 2025

youtube.com

Anthropic Claude Pricing Calculator (Free & Easy)

youtube.com

Anthropic Claude Code's New Pricing Structure and Rate ...

View all

Finout

finout.io › blog › anthropic-api-pricing

Anthropic API Pricing: Complete Guide and Cost Optimization Strategies (2025)

September 1, 2025 - Discover Anthropic API pricing in 2025. See Claude model rates (Haiku, Sonnet, Opus), subscription tiers, usage limits, and best practices to cut token costs and optimize spend at scale.

reddit.com › r/anthropic › how much are you spending on coding with claude?

r/Anthropic on Reddit: How much are you spending on coding with Claude?

June 9, 2025 -

$1.67 in Anthropic API costs per hour of coding in Zed with Sonnet 4 Thinking so far. Not too shabby imho. What do you guys get?

Top answer

1 of 9

nine aromatic liquid alleged squeal subtract cooperative correct sharp library This post was mass deleted and anonymized with Redact

2 of 9

200$ of the max subscription. In 13 days it paid off more than 20 times already...

reddit.com › r/thinkingdeeplyai › i analyzed the ai api price war between open ai, google and anthropic. here’s the brutal truth for devs and founders. it's the golden age of cheap ai

r/ThinkingDeeplyAI on Reddit: I analyzed the AI API Price War between Open AI, Google and Anthropic. Here’s the brutal truth for devs and founders. It's the Golden Age of Cheap AI

June 15, 2025 -

I just went down a rabbit hole analyzing the 2025 AI API landscape, comparing the complicating API costs for OpenAI, Google, and Anthropic. The competition is absolutely brutal, prices are really low right now, and capabilities are exploding!

I’ve crunched the numbers and summarized the key takeaways for everyone from indie hackers to enterprise architects. I’m attaching some of the key charts from the analysis to this post.

TL;DR: The 3 Big Takeaways

AI is stupidly cheap right now. For most apps, the API cost is a rounding error. Google in particular is destroying the competition on price. If you’ve been waiting to build, stop. This might be the cheapest AI will ever be.
There is NO single “best” provider. Anyone telling you "just use X" is wrong. The "best" model depends entirely on the specific task. The winner for summarizing a document is different from the winner for powering a chatbot.
The smartest strategy is a "Multi-Model World." The best companies are building a routing layer that picks the most cost-effective model for each specific API call. Vendor lock-in is the enemy.

Have a read through the 12 infographics attached that give some great metric comparisons across the providers

Part 1: The Three Tiers of AI: Brains, All-Rounders, and Sprinters

The market has clearly split into three categories. Knowing them is the first step to not overpaying.

The Flagship Intelligence (The "Brain"): This is Anthropic's Claude 4 Opus, OpenAI's GPT-4o, and Google's Gemini 2.5 Pro. They are the most powerful, best at complex reasoning, and most expensive. Use them when quality is non-negotiable.
The Balanced Workhorses (The "All-Rounder"): This is the market's sweet spot. Models like Anthropic's Claude 4 Sonnet, OpenAI's GPT-4o, and Google's Gemini 1.5 Pro offer near-flagship performance at a much lower cost. This is your default tier for most serious business apps.
The Speed & Cost-Optimized (The "Sprinter"): These models are ridiculously fast and cheap. Think Anthropic's Claude 3.5 Haiku, OpenAI's GPT-4o mini, and Google's Gemini 1.5 Flash. They're perfect for high-volume, simple tasks where per-transaction cost is everything.

Part 2: The Price Isn't the Whole Story (TCO is King)

One of the biggest mistakes is picking the API with the lowest price per token. The real cost is your Total Cost of Ownership (TCO).

Consider a content marketing agency generating 150 blog posts a month.

Strategy A (Cheaper API): Use a workhorse model like GPT-4o. The API bill is low, maybe ~$50. But if the output is 7/10 quality, a human editor might spend 4 hours per article fixing it. At $50/hr, that's $30,000 in labor.
Strategy B (Premium API): Use a flagship model like Claude 4 Opus, known for high-quality writing. The API bill is higher, maybe ~$250. But if the output is 9/10 quality and only needs 2 hours of editing, the labor cost drops to $15,000.

Result: Paying 5x more for the API saved the company nearly $15,000 in total workflow cost. Don't be penny-wise and pound-foolish. Match the model quality to your workflow's downstream costs.

Part 3: The Great Context Window Debate: RAG vs. "Prompt Stuffing"

This is a huge one for anyone working with large documents. The context window sizes alone tell a story: Google Gemini: up to 2M tokens, Anthropic Claude: 200K tokens, OpenAI GPT-4: 128K tokens.

The Old Way (RAG - Retrieval-Augmented Generation): You pre-process a huge document, break it into chunks, and store it in a vector database. When a user asks a question, you find the most relevant chunks and feed just those to the model.
- Pro: Very cheap per query, fast responses.
- Con: Complex to build and maintain. A big upfront investment in developer time.
The New Way (Long-Context / "Prompt Stuffing"): With models like Google's Gemini, you can just stuff the entire document (or book, or codebase) into the prompt and ask your question.
- Pro: Incredibly simple to develop. Go from idea to production way faster.
- Con: Can be slower and MUCH more expensive per query.

The trade-off is clear: Developer time (CapEx) vs. API bills (OpEx). The reports show for an enterprise research assistant querying a 1,000-page document 1,000 times a month, the cost difference is staggering: RAG is ~$28/month vs. the naive Long-Context approach at ~$1,680/month.

Part 4: Who Wins for YOUR Use Case?

Let's get practical.

For the Hobbyist / Indie Hacker: Cost is everything. Start with Google's free tier for Gemini. If you need to pay, OpenAI's GPT-4o mini or Google's Gemini 1.5 Flash will cost you literal pennies a month.
For the Small Business (e.g., Customer Service Chatbot): This is the "workhorse" battleground. For a chatbot handling 5,000 conversations a month, the cost difference is stark:
- Google Gemini 1.5 Pro: ~$38/month
- Anthropic Claude 4 Sonnet: ~$105/month
- OpenAI GPT-4o: ~$125/month
- Verdict: Google is the aggressive price leader here, offering immense value.
For the Enterprise: It's all about architecture. For frequent tasks, a RAG system with a cheap, fast model is the most cost-effective. For one-off deep analysis of massive datasets, the development-time savings from Google Gemini's huge context window is the key selling point.

Part 5: Beyond Text - The Multimodal Battleground

Images: It's a tight race. Google's Imagen 3 is cheapest for pure generation at a flat $0.03 per image. OpenAI's DALL-E/GPT-Image offers more quality tiers ($0.01 to $0.17), giving you control. Both are excellent for image analysis. Anthropic isn't in this race yet.
Audio: OpenAI's Whisper remains a go-to for affordable, high-quality transcription (~$0.006/minute). Google has a robust, competitively priced, and deeply integrated audio API for speech-to-text and text-to-speech.
Video: Google is the undisputed leader here. They are the only one with a publicly priced video generation model (Veo 2 at $0.35/second) and native video analysis in the Gemini API. If your app touches video, you're looking at Google.

Controversial Take: Is Claude Overpriced?

Let's be blunt. Claude Opus 4 costs $75.00 per million output tokens. GPT-4o costs $15.00. Gemini 2.0 Flash costs $0.40. That means Claude's flagship is 5x more expensive than OpenAI's and over 180x more expensive than Google's fast model.

Yes, Claude is excellent for some long-form writing and safety-critical tasks. But is it 5x to 180x better? For most use cases, the answer is a hard no. It feels like luxury car pricing for a slightly better engine, and for many, it's a premium trap.

Final Thoughts: The Golden Age of Cheap AI

Google is playing chess while others play checkers. They are weaponizing price to gain market share, and it's working. They offer the cheapest pricing, the largest context windows, and full multimodal support.

This is likely the cheapest AI will ever be. We're in the "growth at all costs" phase of the market. Once adoption plateaus, expect prices to rise. The single best thing you can do is build a simple abstraction layer in your app so you can swap models easily.

The future isn't about one AI to rule them all. It's about using the right tool for the right job.

Now, go build something amazing while it's this cheap.

What are your go-to models? Have you found any clever cost-saving tricks?

Top answer

1 of 3

All of the details behind our analysis of the pricing from Open AI, Anthropic and Google is at this link. I do find it funny to have the AI models evaluate themselves and their competitors on features, pricing and limits. Ironically, Claude insisted in it's report that Google Gemini models were the best deal and harped on if the cost of Claude 4 Opus is really worth 5X more? Google, was much more modest and balanced about the whole thing! Perplexity deep research graphics provide the third party truth. https://thinkingdeeply.ai/deep-research-library/the-ai-api-price-war-between-open-ai-google-and-anthropic-the-golden-age-of-cheap-ai

2 of 3

Why 1.5 pro not 2.5 flash

reddit.com › r/chatgptcoding › roocode + anthropic key is really expensive!

r/ChatGPTCoding on Reddit: Roocode + Anthropic Key is really expensive!

March 1, 2025 -

I’m new to this AI IDEs thing, and I’m currently using Roo with my own Anthropic API key. So far, it’s really expensive, sometimes a single prompt costs me up to $0.40 with Claude Sonnet 3.7. Now I’m considering other options, but I don’t know which one to choose.

Does anyone have any idea which alternative would be the most cost-effective, especially for large projects?

Top answer

1 of 5

The new Gemini 2.5 Pro model, there is a paid one now and it’s less expensive than Claude 3.7 Thinking and better IMO. With 1 million context length

2 of 5

am i the only one who thinks it's fucking hilarious that a magic coding genie in the cloud costs $0.40 to execute some pretty complex shit and people complain that's expensive?

reddit.com › r/anthropic › claude-3 pricing - holy smokes!

r/Anthropic on Reddit: Claude-3 Pricing - Holy smokes!

March 5, 2024 -

I was about to dust off some code to re-evaluate claude-3 (claude-2 didn't do a good job for our needs) and then saw the pricing - Opus is $75/1m, Sonnet is $15/1m, and Haiku is $1.25/1m tokens for output!

Having said that, I'm mildly curious what $75/1m tokens gets me in terms of quality :)

https://www.anthropic.com/news/claude-3-family

Top answer

1 of 5

I can't even try it because I live in the EU and for some reasons it doesn't work here. I tried Sonnet however and it's a huge, like in huge, improvement over Claude 2

2 of 5

I can say that I just used Opus to produce an outline of a chapter of a textbook. This is probably by top 2 interest in AI as a student, and Opus has produced what literally everything else has failed: a truly comprehensive outline. Even if it's missing stuff, it's not enough for me to care about.

reddit.com › r/claudeai › cost of api integration with anthropic

r/ClaudeAI on Reddit: Cost of API integration with anthropic

March 17, 2024 -

Hi, I am completely new to Anthropic API.

I am trying a few prompts using https://github.com/Doriandarko/claude-engineer/ to build a simple reactJs web app with Sonnet 3.5 model using API. and I notice the credits go down really fast.

What are some general tips to save money for development?

Top answer

1 of 5

What are some general tips to save money for development? here is what i used to do before: i would start a new chat thread called youtube app for example and ask all my code questions there as you can imagine, this thread will get veeeeeery long. i learned here on reddit that everything gets sent to the api again--including the very first prompt/question that has already been answered a long time ago--which will eat up tons of credits! i've now learned to stop doing this and start a new chat (or fork messages) whenever possible.

2 of 5

Short messages, use haiku as much as possible.

Find elsewhere

Google Bing Mojeek

reddit.com › r/claudeai › understanding pricing

r/ClaudeAI on Reddit: Understanding pricing

July 17, 2024 -

Hi, I'm new to Claude (and LLMs in general) and I don't quite understand the pricing. My goal is to use Claude paired with the aider tool.

For example, Claude's Haiku model costs: $0.25 / MTok (Input) $1.25 / MTok (Output) (https://www.anthropic.com/pricing#anthropic-api)

Does this mean that I get charged $0.25 for every million tokens I send (million words I assume?) and $1.25 for every million tokens I receive? That seems like a lot of tokens for very little money. Is this a monthly charge? Am I overestimating the amount of data that I can send/recieve?

Top answer

1 of 5

Curent Anthropic Pricing: Model Name | Release Date | Input Price (m/tok) | Output Price (m/tok) | Haiku 3 | 20240307 | $0.25 | $1.25 | Sonnet 3/3.5 | 20240229 | $3.00 | $15.00 | Opus 3 | 20240229 | $15.00 | $75.00 You are correct, but bear in mind, during a chat OUTPUT tokens become INPUT tokens at the next turn (you send your previous OUTPUT as the INPUT). Here is an estimate of words to tokens: Text Type | Words | Tokens | Ratio | Simple English | 100 | 75 | 1.33:1 | Technical Documentation | 100 | 85 | 1.18:1 | Twitter-style with hashtags | 100 | 110 | 0.91:1 | Legal Jargon | 100 | 80 | 1.25:1 You can limit OUTPUT tokens at the API level for cost control if you need. Yes, Opus is 60 times the cost of Haiku.

2 of 5

Since the docs are rather clear about not being intended for individual use: Please note that access to the API is subject to our Commercial Terms of Service and is not intended for individual use. I'll try to describe it in more general terms, it works similarly with OpenRouter for example. First you have to buy some prepaid "Credits", it's basically like a prepaid debit card that can only be used for making LLM calls. You get billed for your usage. So for example with Haiku's input tokens, the formula is token_count * ($0.25 / 1'000'000). So for example 100'000 input tokens in a single call cost $0.025, so 2.5 cents. These get deducted from your total credits when you make that call. You can always top up your credits at (for example with OpenRouter) https://openrouter.ai/credits . Also, take a look between the difference between tokens and words here: Tokens I think you understood the rest of how the pricing worked, right?

reddit.com › r/singularity › openai / anthropic / google are pricing their models to still run at enormous annual losses. what's their endgame here?

r/singularity on Reddit: OpenAI / Anthropic / Google are pricing their models to still run at enormous annual losses. What's their endgame here?

November 22, 2023 -

These LLMs cost millions per day to run, and even with super popular paid API services, the revenue is nowhere near enough to cover mind-boggling costs. What happens next?

The big industry AI leaders raise prices to cover OpEx, and companies realize a low-paid human is just less of a headache compared to an LLM?

Do they just keep absorbing billions in losses like Uber did until cab companies were destroyed, then enjoy no competition?

Are they holding out until a model is capable of displacing enough people that it actually is a good value for business customers?

Top answer

1 of 53

297

With software like this, the cost will drop rapidly, but the price won't. Sincerely. Lots of the research is specifically aimed at dropping the cost significantly, and future hardware will run models of today more efficiently. The value gained from having users and having user data offsets the loss right now, and eventually, it won't matter.

2 of 53

why you say that the running cost is unprofitable for the API's. They are already making money. The investments into new models are the "enormous losses" but that is research/ build up. It is not at all like Uber. OpenAI said they have $1.6 billion in earnings already. Again they spend WAY more but it is into future products. If they stopped research now they would be a profitable business(but completely behind in a year). A lot of the investments are in new hardware and Trainingsets, labeling.. which doesn't just vanish. Sure many small startups are just investing and earning nothing but that is always the case with startups most of them go bust.

reddit.com › r/anthropic › should i get the $20 for the consumer pro version, or pay as you go with the api version?

r/Anthropic on Reddit: Should I get the $20 for the consumer Pro version, or pay as you go with the API version?

December 26, 2023 -

I'm new to Claude and I use AI for personal daily tasks, maybe 20-30 times per day when I use it. I don't want to pay $20 a month because I don't to get into yet another recurring subscription.

I got the free $5 credit for the API version to try Opus and I'd be happy using it via that interface.

Would it make more sense for me to just add credit to the plan of the API version when I need to?

Top answer

1 of 4

I've been using both (with the free 5 dollar credit for the API). The API version, I use with SillyTavern so that I can easily set different contexts and prompts, and I mostly use Claude Sonnet with the API. So far I've only spent pennies (cents). When the free amount runs out, I might top up just for the convenience and flexibility, even though using Sonnet is free via the Web. I actually seem to get Opus-level responses from Sonnet when prompted the right way in the fields I'm interested in. Another advantage of the API is that you can set temperature, top K and top P to different levels for more or less creative responses and/or variety. Of course, when I want to use Claude Opus, I generally use the Web version I've paid for. As I'm still evaluating, I'm not quite sure what I'll do going forward, but my experience so far with both the subscription and the API has been great.

2 of 4

Add: How to access API version?

reddit.com › r/anthropic › api calls and rate limits

r/Anthropic on Reddit: API calls and rate limits

December 27, 2024 -

Hi All, I am considering publishing an app that I use all the time. Other people have seen it and have expressed an interest in using it. The problem is that when it runs even for myself, it hits api call rate limits. I am currently on a good rate limit of 160000 input tokens per minute. But, how could I somehow ensure that other people could use it too without us all congesting the interface please? How do businesses usually deal with this? I am happy to pay for the extra calls but could do without these rate limits. Thank you.

Top answer

1 of 5

There is a “Contact Sales for higher rate limit”.

2 of 5

Open Router is a decent alternative!

reddit.com › r/claudeai › questions on anthropic enterprise pricing

r/ClaudeAI on Reddit: Questions on Anthropic Enterprise Pricing

February 2, 2025 -

Hello all,

We (ie my company) are trying to figure out if we should get the on the enterprise plan for Anthropic's API primarily to avoid getting rate limited for our internal products. I was wondering if anyone here has already talked to their sales team and knows what their pricing model for enterprises is.

It would also be helpful to discuss if they have other splits / models within their enterprise model as well. Thanks in advance ! Feel free to DM me if you want to

Top answer

1 of 2

When asking about features, please be sure to include information about whether you are using Claude Web interface (FREE) or Claude Web interface (PAID) or Claude API Sonnet 3.5, Opus 3, or Haiku 3 Different environments may have different experiences. This information helps others understand your particular situation. I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

2 of 2

You need to contact sales and minimum 20 seats. Are you that much limited? You can use 2 -3 pro accounts to dodge the limit, this is what I do, and I'm sure more the users won't hit it so often. Also educate, you user to avoid long chats. You don't have any idea how with some discipline you avoid a lot limit. And since Sonnet 3.7 I feel the limit had gone for me. Cancelling my second account and wait until it hit me to pay for it again.

reddit.com › r/anthropic › is claude api cheaper than chat for heavy usage? need project/knowledge base management

r/Anthropic on Reddit: Is Claude API cheaper than chat for heavy usage? Need project/knowledge base management

August 26, 2024 -

I'm currently using Claude 3.5 Sonnet through the chat interface with a limit of 45 messages per 5 hours. My project is consuming over 50% of the available tokens (according to information from UI project information). I have two main questions:

Is using the Claude API potentially cheaper for this level of usage compared to the chat interface? And if there is option like adding files with information about project
Are there any tools or services that are wrapper over API that offerspecifically the ability to maintain a knowledge base of files for my project?

If anyone has experience with similar usage patterns or knows of suitable project management tools for use with Claude, I'd appreciate your insights.

Top answer

1 of 3

Can't exactly comment on if it will be cheaper, but the API tends to be the go-to for most advanced users. I'd recommend signing up for the API and playing around in the workbench; you'll see how the Project user interface was built. I wound up using both; too many things are too simple on the website to just hammer out and come back to later. There are absolutely desktop/web programs that do more advanced user interfaces, but I'm still learning about these and trying them myself. Most of the chatbot interfaces have project-like capabilities, plus more. Some of the tools are full knowledge management applications, so scroll through all the categories. There's a github project that tracks this stuff: https://github.com/openbestof/awesome-ai

2 of 3

I use API of Claude, OpenAI, and Google Gemini, and use BoltAI for the GUI / interface for my LLM API. I’m not even a programmer or tech person, I’m a marketer doing a lot of data crunching related with business numbers, digital marketing, creative stuff of social media, and all things related with marketing. BoltAI is great for me as the LLM GUI. And the API price for me is quite cheap, like $6-9 for each Claude and OpenAI. I’ve tried Claude Pro and ChatGPT plus previously, but I love to to use API with the GUI for flexibility in terms of pricing and advanced setting such as set the temperature, output token limit, and many more.

reddit.com › r/claudeai › can anthropic keep up with those pricing ?

r/ClaudeAI on Reddit: Can Anthropic keep up with those pricing ?

March 16, 2025 - The gemini APi project is using my billign account, I can see the usage, but it's only free usage, no money is beign spent and I hit the quotas... I am unable to find an answer from anywhere. ... I mean I’m willing to pay because as always regardless of the benchmarks sonnet has a magic to it that other models just don’t. ... Anthropic shows no sign of wanting to compete on price...

reddit.com › r/anthropic › claude opus 4 just cost me $7.60 for one task on windsurf.

r/Anthropic on Reddit: Claude Opus 4 just cost me $7.60 for ONE task on Windsurf.

May 23, 2025 -

Yesterday Anthropic dropped Claude Opus 4. As a Claude fanboy, I was pumped.

Windsurf immediately added support. Perfect timing.

So, I asked it to build a complex feature. Result: Absolutely perfect. One shot. No back-and-forth. No debugging.

Then I checked my usage: $7.31 for one task. One feature request.

The math just hit me: Windsurf makes you use your own API key (BYOK). Smart move on their part. • They charge: $15/month for the tool • I paid: $7.31 per Opus 4 task directly to Anthropic • Total cost: $15 + whatever I burn through

If I do 10 tasks a day, that’s $76 daily. Plus the $15 monthly fee.

$2300/month just to use Windsurf with Opus 4.

No wonder they switched to BYOK. They’d be bankrupt otherwise.

The quality is undeniable. But price per task adds up fast.

Either AI pricing drops. Or coding with top-tier AI becomes can be a luxury only big companies can afford.

Are you cool with $2000+/month dev tool costs? Or is this the end of affordable AI coding assistance?

Top answer

1 of 5

Google is eating them alive. I give anthropic 3 years max unless they can figure out how to compete on price.

2 of 5

Try 150$ claude plan + code claude

reddit.com › r/claudeai › anthropic launch batch pricing

r/ClaudeAI on Reddit: Anthropic launch Batch Pricing

October 8, 2024 -

Anthropic have launched message batching, offering a 50% discount on input/output tokens as long as you can wait for up to 24 hours for the results.. This is great news.

Alex Albert Twitter Thread

Anthropic API Page

Pricing out a couple of scenarios for Sonnet 3.5 looks like this (10,000 runs of each scenario):

Scenario	Normal	Cached	Batch
Summarisation	$855.00	$760.51	$427.50
Knowledge Base	$936.00	$126.10	$468.00

What now stands out is that for certain tasks, you might still be better off using the real-time caching API rather than batching.

Since using Caching and Batch interfaces require different client behaviour, it's a little frustrating that we now have 4 input token prices to consider. Wonder why Batching can't take advantage of Caching pricing....?

Scenario Assumptions (Tokens): Summarisation - 3,500 System Prompt. 15,000 Document Length. 2,000 Output. Knowledge Base - 30,000 System Prompt/KB. 200 Question Length. 200 Output.

Pricing (Sonnet 3.5):

Type	Price (m/tok)
Input - Cache Read	$0.30
Input - Batch	$1.50
Input - Normal	$3.00
Input - Cache Write	$3.75
Output - Batch	$7.50
Output - Normal	$15.00

Top answer

1 of 5

With that long wait you can just use llama 405 on CPU and it will be much cheaper and faster.

2 of 5

It’s probably good for generating datasets.

Anthropic

anthropic.com › pricing

Pricing | Claude

Explore Claude pricing plans and API costs. Choose from Free, Pro, Max, Team, or Enterprise options to get started with Claude.

reddit.com › r/claudeai › [ removed by moderator ]

r/ClaudeAI on Reddit: [ Removed by moderator ]

May 28, 2024 - Maybe 5$ maximum per month on the API side of things. ... And now that we also have caching support in the Anthropic API, it will probably be even less moving forward. Basically the Claude Project is a cache of files which is now supported also ...

reddit.com › r/claudeai › anthropic is definitely losing money on pro subscriptions, right?

r/ClaudeAI on Reddit: Anthropic is definitely losing money on Pro subscriptions, right?

July 31, 2024 -

Well, at least for the power users who run into usage limits regularly–which seems to pretty much be everyone. I'm working on an iterative project right now that requires 3.5 Sonnet to churn out ~20000 tokens of code for each attempt at a new iteration. This has to get split up across several responses, with each one getting cut off at around 3100-3300 output tokens. This means that when the context window is approaching 200k, which is pretty often, my requests would be costing me ~$0.65 each if I had done them through the API. I can probably get in about 15 of these high token-count prompts before running into usage limits, and most days I'm able to run out my limit twice, but sometimes three times if my messages replenish at a convenient hour.

So being conservative, let's say 30 prompts * $0.65 = $19.50... which means my usage in just a single day might've cost me nearly as much via API as I'd spent for the entire month of Claude Pro. Of course, not every prompt will be near the 200k context limit so the figure may be a bit exaggerated, and we don't know how much the API costs Anthropic to run, but it's clear to me that Pro users are being showered with what seems like an economically implausible amount of (potential) value for $20. I can't even imagine how much it was costing them back when Opus was the big dog. Bizarrely, the usage limits actually felt much higher back then somehow. So how in the hell are they affording this, and how long can they keep it up, especially while also allowing 3.5 Sonnet usage to free users now too? There's a part of me that gets this sinking feeling knowing the honeymoon phase with these AI companies has to end and no tech startup escapes the scourge of Netflix-ification, where after capturing the market they transform from the friendly neighborhood tech bros with all the freebies into kafkaesque rentier bullies, demanding more and more while only ever seeming to provide less and less in return, keeping us in constant fear of the next shakedown, etc etc... but hey at least Anthropic is painting itself as the not-so-evil techbro alternative so that's a plus. Is this just going to last until the sweet VC nectar dries up? Or could it be that the API is what's really overpriced, and the volume they get from enterprise clients brings in a big enough margin to subsidize the Pro subscriptions–in which case, the whole claude.ai website would basically just be functioning as an advertisement/demo of sorts to reel in API clients and stay relevant with the public? Any thoughts?

Top answer

1 of 30

139

No because most paid user will end up doing minimal number of query

2 of 30

On some days I run into limits 3-4 times. On other days I'm not touching Claude altogether because I'm either exhausted or have other stuff to do.