🌐
OpenRouter
openrouter.ai › pricing
Pricing - OpenRouter | OpenRouter
Transparent pricing for OpenRouter. Pay only for what you use with access to 500+ AI models. Free tier, Pay-as-you-go, and Enterprise plans available.
Chat
A router for LLMs and other AI models
Models
Browse models on OpenRouter
LLM Rankings
Language models ranked and analyzed by usage across apps
OpenRouter Quickstart Guide
Get started with OpenRouter's unified API for hundreds of AI models. Learn how to integrate using OpenAI SDK, direct API calls, or third-party frameworks.
🌐
OpenRouter
openrouter.ai › docs › faq
OpenRouter FAQ | Developer Documentation | OpenRouter | Documentation
There is usually a different price for prompt and completion tokens. There are also models that charge per request, for images and for reasoning tokens. All of these details will be visible on the models page. When you make a request to OpenRouter, we receive the total number of tokens processed by the provider. We then calculate the corresponding cost and deduct it from your credits.
🌐
Inverted Stone
invertedstone.com › calculators › openrouter-pricing
OpenRouter Models Pricing Calculator | Compare Model Costs Instantly
Our OpenRouter pricing calculator helps you quickly estimate API costs across leading providers including Anthropic, OpenAI, xAI, Google, Perplexity, Mistral, and DeepSeek.
🌐
Reddit
reddit.com › r/chub_ai › open router cost per different ai models comparison
r/Chub_AI on Reddit: Open Router Cost per different AI Models Comparison
June 23, 2024 -

So I'm beginning to use the different AI's on Open Router.

The main one's I've been using are GPT4o, Claude Sonnet, Claude Haiku and Command R Plus.

-GPT4o and Command R Plus give great responses, but cost a lot.

-Claude Sonnet gives longer responses then Claude Haiku, but both work fairly well unless it's NSFW stuff (to which I use a JB) and don't cost as much. I tired Claude Opus, but man that was expensive as shit and probably won't use that again.

So from I've been seeing about cost are this

-GPT4o (0.015 cents)

-Command R Plus (0.015 cents)

-Claude Sonnet (0.010 cents)

-Claude Haiku (0.001 cents)

Does that seem around what anyone else spends for reach response? Note, I keep my Min tokens at 300, with Max token at 350.

🌐
OpenRouter
openrouter.ai › compare › openai › chatgpt-4o-latest
ChatGPT-4o compared to other AI Models | OpenRouter
Compare AI models by key metrics like price, latency, throughput, context length, and other features.
🌐
GitHub
github.com › jonasnordlund › openrouter-calc
GitHub - jonasnordlund/openrouter-calc: A simple OpenRouter language model price calculator.
This is a simple price calculator for the language models provided by OpenRouter. It fetches the latest language model list along with pricing details and more, and allows you to specify your expected number of input tokens, output tokens, and ...
Author   jonasnordlund
🌐
Open WebUI
openwebui.com › f › kkoolldd › cost_tracker_openrouter
Cost Tracker - OpenRouter Function • Open WebUI Community
Cost Tracker - OpenRouter Function • Open WebUI Community - This function calculates API interaction pricing in Open WebUI and records user model usage for OpenRouter models. Based on maki's edit of bgeneto's cost tracker. See https://github.com/bgeneto/open-webui-cost-tracker for usage.
🌐
Reddit
reddit.com › r/xoulai › price / cost comparison of popular llm's from an openrouter/api perspective. thoughts on monthly fee's. (long)
r/XoulAI on Reddit: Price / Cost comparison of popular LLM's from an openrouter/API perspective. Thoughts on monthly fee's. (long)
July 19, 2025 -

Hello everyone,

Given the discussion of pricing, tiers and costs of these AI sites going around. I figured some hard numbers would be helpful so people can understand what we’re looking at. It might give some perspective.

As someone that uses OpenRouter for API key access to LLM’s. I pay money and choose what models I want to interact with, each chat in/out costs money based upon that specific model. While we don’t know what kind of pricing a chatbot site will get if they host their own model, rent servers, bulk discounts, etc etc. It gives us an understanding of the cost of these LLM’s from a highly popular site.

TLDR:

If you come home at night and chat a bit, have a 300 response chat and do this daily.

Monthly cost - Personal - This is the price it COSTS ME to chat straight with the LLM, daily at 300chat responses. I can do WAY more than this, though. It’s why I don't have an issue paying $10 for a service (or more, for good models) I would pay that through API if I wasnt giving it to a good site which provided me all kinds of other services along with the access to the LLM

DS V3 = $2.61/m

2.0 Flash = $1.49/m

Llama = $1.149/m

2.5 Pro = $28.98/m

Nemo = $0.0051/m


Tokens vs Characters

The first thing everyone has to understand is tokens vs characters.

Tokens are units of text (words, subwords, or punctuation) used by LLMs, where 1 token ≈ 4 characters or 0.75 words in English. Characters are individual letters, numbers, or symbols. Tokens group characters for processing

Just remember. "hello" = 1 token and 5 characters

These are the top 20 most popular LLM’s being used from OpenRouter for roleplay.

Now here is a chatroom which I dropped a query to these bots and got varied response lengths in order to generate costs to tokens so we have an easily organized section to view. Costs


If we take an average response length of 300 tokens from a chatbot during a conversation. Let’s look at some numbers. For this example we will ignore the cost IN, which is usually pretty minimal and focus on cost OUT.

Lets start out with Deepseek, 2.0 flash, and llama 3.1 (widely used for fine tunes). Widely considered the benchmark for roleplay, as we can see by how popular it is. These are widely popular because they offer a very high standard of roleplay at a cheaper price.

Deepseek v3

275 tokens = $.000266 / 275 * 300 = $.00029 each response * 300 chat responses = $.087 for a 300 chat session

Gemini 2.0 Flash

426 tokens = $.000236 / 426 * 300 = $.00016 each response * 300 chat responses = $.049 for a 300 chat session

Llama 3.1

270 tokens = $.000115 / 270 * 300 = $.00012 each response * 300 chat responses = $0.038 for a 300 chat session


Now let’s look at your expensive options

Claude Sonnet 4

We won’t bother with Claude because it's ridiculously expensive, just check the image. Its about 2x more expensive than Gemini 2.5 Pro

Gemini 2.5 pro

2068 tokens = .0222 / 2068 * 300 = $.0032 each response * 300 chat responses = $.966 for a 300 chat session


Now a cheap option. You typically see your Mistral, Hermes as finetunes

Mistral Nemo

218 tokens = .000000417 / 218 * 300 = broken * 300 chat responses = $ 0.00017 for a 300 chat session

This is why you usually see Nemos, Hermes for free. They’re dirt cheap. But you know you’re chatting on something like this rather than Flash or Deepseek, unless they are very good finetunes.


If you come home at night and chat a bit, have a 300 response chat and do this daily.

Monthly cost - Personal - This is the price it COSTS ME to chat straight with the LLM, daily at 300chat responses. I can do WAY more than this, though. It’s why I don't have an issue paying $10 for a service (or more, for good models)

DS V3 = $2.61

2.0 Flash = $1.49

Llama = $1.149

2.5 Pro = $28.98

Nemo = $0.0051


So. People want to bitch about spending $3-4 a month (or even $10/m) and expect this service for FREE! When we can clearly see that even if a site offers Nemo/Hermes or other base models for free, that is still money straight from their pocket. Let’s ignore the fact that most people want the better models for free.

I’ve heard Jupiter costs 50k/month.

Now, we need to understand that AI sites do NOT run their costs through a service like open router. They will typically pay a cost for servers and host the LLM on those servers, doing their own fine tune.

Many AI chatbot sites rely on their subscribers to offset the cost of their free services. Sites like CS have a good Subscriber base so they can afford to offer some more bots for free, but that is still a negative for the site. Since Xoul had such a large freemium base to start, not as many people had reason to subscribe.

Anyways. If you made it this far... what's wrong with you? Ha! Love to all and I'm excited for Xoul to return. Hope I could provide SOME perspective.

Find elsewhere
🌐
GitHub
github.com › Aider-AI › aider › issues › 3055
Feature Request: Get OpenRouter pricing from API · Issue #3055 · Aider-AI/aider
January 29, 2025 - Hey, I have been switching to OpenRouter exclusively for a while now and I think it would be great if aider would pull the pricing info from the API when it uses an OpenRouter model. I know you can add configuration for model pricing loc...
Published   Jan 29, 2025
🌐
GitHub
github.com › orgs › langfuse › discussions › 3559
Token Cost using OpenRouter · langfuse · Discussion #3559
Langfuse calculates token costs based on model definitions that include pricing information. Since OpenRouter is not natively supported like OpenAI, you will need to define these models manually. ... Add Custom Model Definitions: Use the Langfuse Models API to add your custom model definitions for OpenRouter. This involves specifying attributes such as match_pattern, unit, and prices per unit (input, output, total).
🌐
OpenRouter
openrouter.ai
OpenRouter
Better prices, better uptime, no subscriptions. ... Access all major models through a single, unified interface. OpenAI SDK works out of the box.Browse all ... Reliable AI models via our distributed infrastructure. Fall back to other providers when one goes down.Learn more · Keep costs in check without sacrificing speed. OpenRouter ...
🌐
Hacker News
news.ycombinator.com › item
You can use something like OpenRouter, which lets you access essentially all com... | Hacker News
September 5, 2024 - You pay a different rate per model (OpenRouter shows the pricing transparently). You load your account with credits. I use it daily (undoubtedly far more than the average user) and loaded 50$ with credits five months ago, but I still have over 1/2 of it left · I think it is hard to believe ...
🌐
Inverted Stone
invertedstone.com › calculators › openai-pricing
OpenAI API Pricing Calculator | GPT-5.2 pro, GPT-5.2, GPT-5.1 & GPT-5
Enter your estimated token usage to calculate costs ... Access 300+ AI models via OpenRouter including OpenAI, Claude, Gemini, and more.
🌐
Pages
compare-openrouter-models.pages.dev
OpenRouter Model Price Comparison
Compare pricing across different AI models available on OpenRouter
🌐
Skywork
skywork.ai › home › openrouter review (2025): unified api access to 500+ ai models — pricing, performance, and pitfalls
OpenRouter Review 2025: Unified AI Model API, Pricing & Privacy
September 17, 2025 - Comprehensive 2025 review of OpenRouter: unified API, 500+ AI models, pricing, privacy, developer insights, and best alternatives for integration.
🌐
Medium
medium.com › @phsaurav › breaking-free-from-ai-subscriptions-cost-effective-all-in-one-solution-with-openrouter-a1f596ce1227
Breaking Free from AI Subscriptions: Cost-Effective All-in-One Solution with OpenRouter | by Parvez Hossain Saurav | Medium
March 15, 2025 - Perhaps the most appealing aspect is OpenRouter’s transparent pricing structure. Costs are clearly displayed for each model and prompt, with no additional fees beyond what you’d pay through official APIs.
🌐
OpenRouter
openrouter.ai › docs › use-cases › usage-accounting
Usage Accounting | Track AI Model Usage with OpenRouter | OpenRouter ...
Enabling usage accounting will add a few hundred milliseconds to the last response as the API calculates token counts and costs.
🌐
Skywork
skywork.ai › home › openrouter review (2025): a developer’s deep dive into the leading multi‑model llm api gateway
OpenRouter Review 2025: API Gateway, Latency & Pricing Compared
September 17, 2025 - In-depth 2025 review of OpenRouter’s LLM API gateway. Covers model access, latency, routing, pricing, privacy, and best-fit scenarios for developers.