Videos
Google has added speech generation capabilities to Google AI Studio and API. It supports both single-speaker and multi-speaker text-to-speech (gemini-2.5-pro-preview-tts, gemini-2.5-flash-preview-tts)
This means we can now create podcasts similar to what NotebookLM does. I tried it, and it's really great.
- I took a document, loaded it into Gemini 2.5 Pro, and asked it to generate a podcast with two speakers based on this document.
- Then, I took this script and loaded it into a text-to-speech model and received the perfect podcast.
I am greatly impressed.
I’ve been experimenting with different text-to-speech voices in Google AI Studio to see how they might work for short data analysis tutorial videos, and to do that I made this short video (less than three minutes).
Each voice reads one of four short scripts, so you can hear a range of tones and delivery styles. These are not all of the available voices in AI Studio — this sample represents about half of them.
You’ll also notice some “Recommended by Gemini” speech bubbles. Those highlight the voices that Gemini suggested as especially well-suited for educational or technical content. Personally I don't agree with some of those picks.
The video also has chapter markers, and you’ll find the links in the description so it’s easy to jump back and forth between voices.
Video link: https://youtu.be/dFE7TPF8Uu4?si=L4IagEoE5dD6Qu5O
Just sharing in case others are curious about how the voices sound side by side.
Hi. I am a FREE account. I am wandering is there a limit (daily/monthly) for generating text to voice with Google Ai Studio.
Can anyone share information or experience on it.
Hi, I recently started using Gemini 2.5 Pro which I think is the best AI model I have tried, I use it only for proofreading, sharing and refining Brainstorming and the like. But it bothers me a bit that I only have like 5 messages per day and then I have to wait almost 24 hours to be able to use it again.
I have searched for Google Studio Ai because I have seen that many people mention that it is as good or even better in some aspects, but it is not clear to me if it is free or not. I have tried to get information in my own language (I don't speak English) but the tutorials about it are only about how to use it for programming or take advantage of its more "Powerful" functions but they never clarify basic points such as if it is free or if it works for pure text.
I have searched online and in English publications and as far as I understand, using the Google Ai Studio page is free, only the API costs, is that correct? Did I understand it right or are there other things I should know?
Thanks in advance for your answers!
I want to use Gemini 2.5 Pro text to speech for my monetized Youtube videos. The current Preview model in Google AI Studio is only for personal use and not for commercial use. I am willing to pay for it.
I am new to Google AI / Gemini.
Can you explain to me, like I am 5, how can I access the Gemini 2.5 Pro text to speech after it disappeared from the Google AI Studio?
I signed up for Google Cloud AI Vertex, and can see the TTS service using Chirp, which is not the Gemini 2.5 Pro TTS.
Will the same user interface, where you can enter style prompts and download the audio be available in Google Cloud?
There are so many information online, but I didn't find the answer. I hope there is a service for commercial use similar to the Google AI Studio interface.