We have been using millions of token but the dashboard bill has been 0 in the past couple days.
As everyone know, Copilot limit the premium models to 300 request/month. Google on the other hand is looking to expend their business and are now offering 100 request per day for free of Gemini 2.5 Pro on their API. It's back on the free tier!
-
100 RPD (Request Per Day) for Gemini 2.5 Pro
-
250 RPD for Gemini 2.5 Flash. (It was 500 until now, so they lowered that, but that model is mostly 4.1 intelligence level, maybe slightly above or below depending on the language. And 250 per day is already more than enough.)
https://ai.google.dev/gemini-api/docs/rate-limits
Update: Now they updated the model in the list, just go to the Manage Models... in the model selector and click Gemini, enter your API key from Google AI Studio, select both Pro and Flash and you are good to go.
Make sure you use Gemini 2.5 Pro at the bottom of the list, if not, it count toward your 300 requests.
The model is much smarter than the free 4.1. So that's ~3000 request per month + your 300 premium that you can keep for Sonnet 4. I hope it helps someone. I will surely use that most of the time now instead of the 4.1.
Videos
I've been using Gemini 2.0 Flash and 2.0 Flash Thinking through third-party software AI providers like ChatLLM, which accesses these models via API. While these providers promise privacy, I'm still concerned about Google's role. If the underlying API is free, does that mean Google is extensively using our data for training and other purposes, regardless of what the third-party provider claims? What are your thoughts on this? Are there any ways to mitigate the risks, especially when using these APIs through intermediaries?
Gemini 1.5 APIs are not free. So, I would respect Google to respect user privacy for the 1.5 LLMs.
There are not paid versions of Gemini 2.0 Flash and 2.0 Flash Thinking at the moment. I can't pay even if I want to.
Hello, Iโve got no software developing background and finally figured out how to turn my localhost to https by webhook to connect telegram to my localhost.
Iโm building a bookkeeping flow which allows me to send my receipt through Telegram and it extract data into ky Google sheets.
Then eventually there isnโt any stable API that I can use for freeโฆ (I thought Gemini pro plan is the usage of API)
What model can I choose for a reliable api that I can use in n8n?
(To analysis my receipt in a different language)
Especially I personally prefer open Ai or Gemini.
Thanks in advance!
Just noticed that Google completely gutted free tier API limits - if you have any projects that were using flash models with big daily limits - please check those as most likely if you are not at leas tier 1 - you will be seeing 429's in your console just like i did. This came out of nowhere. Hope this gives a heads up to some people.
Now that google has slashed the free tier of the API, does anybody know of any alternatives we can use? I know there probably isn't a free alternative to 2.5 pro, since big models are obviously expensive to run. But is there another free provider that offers vision-capable models on the level of e.g. 2.5-flash/2.5-flash-lite? 20 rpd is just not enough for what I'm doing :(
Hi guys,
I'm using the Gemini integration in some of my automations, to analyze snapshots of my cameras in certain situations. I had some days with around ~150 calls.
Before, on the Free API tier, gemini had 1000 free requests per day on their flash-lite model. But today I reached the limit after 20 calls.
Do you guys also have this change?
Now that the Gemini API has removed its daily free token limit, I'm looking for any other API providers that still offer a free daily tier. I really enjoy working with Cline, but I currently can't afford to pay for API access.
Does anyone know of any alternatives with a free tier that works well with Cline?
Gemma 4 31B ranks 27th place in arena.ai, that puts it slightly below Gemini 3 Flash in terms of performance.
Gemini API gives you 1500 FREE daily requests for this model with unlimited tokens per minute. This is VERY generous. Highly recommend taking advantage of it while you can.
Previously, you could get a limited amount of free API access to Gemini 2.5 Pro via OpenRouter, but now you can't. So I am connecting to Gemini directly, and am confused about what I will get free, especially if I enable billing. This thread suggested that paid users get more free access to Gemini 2.5 Pro, but it seems like that was a limited time offer.
Looking at the rate limit page, it seems like free users get 100 free requests per day (same as OpenRouter used to be.) But what if I enable billing? Do I still get 100 free requests per day?
I'm trying to figure out any way to reduce my spending on Gemini as it is getting out of hand!
Hey folks,
Iโm looking for the best free way to access the Gemini API. I know Google offers a free tier via AI Studio, but are there any better options?
EDIT May 12: Google added new rate limits to AI studio this morning, 25 RPD, so this is effectively no longer working. We had a good run!
EDIT: The Open Router integration and google ai studio key method seems like the easiest way that works for everyone, especially if you already have an openrouter account. Pasting that method here for visibility:
-
Get a Google AI Studio API key from https://aistudio.google.com/apikey
-
Plug that API key into the Google AI Studio integration on https://openrouter.ai/settings/integrations, select enabled but not "Use as fallback"
-
You can now use your Openrouter key anywhere and as long as you select Gemini 2.5 Pro Experimental as your model, it will use your key. (4. If you also add a GCP key and have Vertex AI enabled for the project, obtained as per the instructions below, you can add that to your integrations as well, and you will have two sources of 2.5 Pro backing your Openrouter calls.)
Hey all,
After dealing with Openrouter and Requesty giving me constant rate limits for Gemini 2.5 Pro, I got frustrated and decided to get things set up directly through Google's APIs. I have now sent over 60 million tokens in a single day without hitting any rate limits, all for $0โan equivalent usage with Claude would have cost $180. I also had a few other engineers confirm these steps. Here's how to do it and then integrate with Roo Code--but this should work for other tools like Cline, too:
Setting Up Google Cloud
-
Create or log into your Google Cloud account.
-
Open the Google Cloud Console.
-
Create a new Google Cloud project (I named mine "Roo Code").
-
Enable billing for your newly created Google Cloud project.
-
Enable the Vertex AI API.
-
Enable the Gemini API from the API overview page.
-
In your project dashboard, navigate to APIs & Services โ Credentials.
-
Click "Create Credentials" โ "API Key".
-
Copy the generated API key and save it securely.
Integrating with Your IDE (Example: Roo Code)
-
In VSCode or Cursor, navigate to the extensions marketplace (Shift + Cmd + X on Mac), search for and install "Roo Code" (or your preferred tool like Cline).
-
Open Roo Code (Cmd + Shift + P, then type "View: Show Roo Code").
-
Click to configure a new API provider, selecting "Google Gemini".
-
Paste the API key you saved earlier into the API key field.
-
Select "google/gemini-2.5-pro-exp-03-25:free" as the model.
-
Click Save.
There you go! Happy coding. Let me know if you run into any issues.
Edit: looks like some are having issues. A few ideas/alternatives:
-
Use a Vertex api key, but gemini api as provider in Roo Code.There is only one key, ignore this alternative. -
Use vertex api as the provider in Roo Code--its just a little more complicated, you'll have to create a service account in the credentials page of the project, and paste the json in Roo Code when configuring the provider
-
If you have an OpenRouter account, you can go to the integrations page https://openrouter.ai/settings/integrations and add your vertex api key to the google vertex integration. You can also add a google ai studio api key to the Google AI Studio integration. In each setting window where you add the key, make sure it is enabled. Then, in Roo Code, you use your openrouter account, but whenever it uses Gemini 2.5 pro free, it will default to your API key, not one of theirs which is being rotated among many users.
Gemini is a new model developed by Google, and Bard is becoming usable again. With Gemini, it is now possible to get almost perfect answers to your queries by providing them with images, audio, and text.
In this tutorial, we will learn about the Gemini API and how to set it up on your machine. We will also explore various Python API functions, including text generation and image understanding.
https://www.kdnuggets.com/how-to-access-and-use-gemini-api-for-free
I'm considering enabling billing for the Gemini API because free tier won't be able to use pro models anymore.
My main questions: Does the paid tier still include any free usage allowance (eg. free for anything within the free tier rate limit, only pay for things above the free tier rate limit), or does everything become billable once you enable billing?
Gemini Free Tier is now down to 20 requests per day, which is essentially unusable for most of us. You might see the same thing in your logs (see below):
I can't see waht the paid tied gives us in terms of increasing requests, so if anyone has a link to that page, please add it in the comments.
Either way, what are our options? Can we do this locally? Is there another more cost effective option than a paid Google AI tier? Anyone tried the paid tier - how many requests does it give and how much is it?
If you do AI locally - please point me in the right direction on how to do this (currently using Rpi for HA)
UPDATE: So far Gemini paid tier is costing around 1p per day.
"error": { "code": 429, "message": "You exceeded your current quota, please check your plan and billing details. For more information on this error, head to: https://ai.google.dev/gemini-api/docs/rate-limits. To monitor your current usage, head to: https://ai.dev/usage?tab=rate-limit. \n* Quota exceeded for metric: generativelanguage.googleapis.com/generate_content_free_tier_requests, limit: 20, model: gemini-2.5-flash..My credits arent being used up why?