Was kind of surprised why my personal project hit quota limit. Checking the limits, it's massively reduced.
New Gemini API rate limits!
python - ResourceExhausted: 429 with "limit: 0" on Gemini API Free Tier (LangChain) even after new project - Stack Overflow
Gemini 2.5 pro API free tier has a 6m token limit
Does having gemini Advanced give you higher limits in AI studio use or not?? Not an API with key and stuff just inside AI studio, also does anyone have an idea of how much limit you get on 2.5 pro in gemini itself with advanced??
Is the Gemini API free to use in 2026?
What are the rate limits on Gemini's free tier?
What are the Gemini 3 Flash free tier limits?
Videos
I don't know if it's a bug or Google is being generous again to gain the devs they lost when they nerfed the API limits in December 2025.
They added Nano Banana Pro to free tier with a 1.5k RPD rate limit, the problem is when I tried testing the models on free tier it tells me that I exceeded my quota, so I am not sure if it's a bug or what. I'm confused no one is talking about it here since y'all are busy with Gemini 3.1
Its because you aren't connected to the API properly. In Gemini CLI after quitting it shows Total Quota Used, If it shows 0 for everything...you aren't connected.
This is a problem I faced when using it through Docker Container.
Here's the docker-compose.yml file to fix that (make sure to update docker desktop to latest version) :
services:
# --- GEMINI 3 (LATEST) ---
# BEST DAILY DRIVER: Smart, Fast, Free
flash3:
image: us-docker.pkg.dev/gemini-code-dev/gemini-cli/sandbox:0.1.1
environment:
- GEMINI_API_KEY=${GEMINI_API_KEY}
- GEMINI_NO_INDEX=true
ports:
- "8100-8110:8000-8010" # Web Preview Port Range (Adjust as needed for tools) (Currently accessible via localhost:8100, 8101, etc.)
volumes: &vols
- D:\Projects\Gemini-CLI-Projects\All-Projects:/app/workspace
- ./gemini_config:/home/node/.gemini
- ./GEMINI.md:/home/node/GEMINI.md:ro
- ./Global-Docs:/app/library:ro
command: gemini -m gemini-3-flash-preview
stdin_open: true
tty: true
also make a .env file, Edit it and add this 1 line :
GEMINI_API_KEY=your_actual_api_key_here
Put your api key instead of "your_actual_api_key_here".
Now go to the Folder where docker-compose.yml is and open cmd > Then paste this line and hit enter :
docker compose run --rm flash3.
Make Sure Docker Engine is Running🙂
The limit: 0 error appears because your Google Cloud project does not have Free Tier quotas activated.
Since March 2024, Google requires linking a billing account even for Gemini Free Tier.
If billing is not enabled, the free quotas for:
generate_content_free_tier_requestsinput_token_countoutput_token_count
are all 0, which causes the 429 ResourceExhausted error even on a new project.
After you link a billing account, the free-tier quotas (50 requests/day, token limits, etc.) become available and the API will work.
The reason the error refers to gemini-2.0-flash is because the client library falls back to the default model internally when quota lookup fails, even if you requested gemini-2.5-flash.
To fix:
Enable billing in your GCP project
Recheck your Gemini quotas
Run your LangChain code again
This will activate the actual free tier.