Videos
Google has added speech generation capabilities to Google AI Studio and API. It supports both single-speaker and multi-speaker text-to-speech (gemini-2.5-pro-preview-tts, gemini-2.5-flash-preview-tts)
This means we can now create podcasts similar to what NotebookLM does. I tried it, and it's really great.
- I took a document, loaded it into Gemini 2.5 Pro, and asked it to generate a podcast with two speakers based on this document.
- Then, I took this script and loaded it into a text-to-speech model and received the perfect podcast.
I am greatly impressed.
I’ve been experimenting with different text-to-speech voices in Google AI Studio to see how they might work for short data analysis tutorial videos, and to do that I made this short video (less than three minutes).
Each voice reads one of four short scripts, so you can hear a range of tones and delivery styles. These are not all of the available voices in AI Studio — this sample represents about half of them.
You’ll also notice some “Recommended by Gemini” speech bubbles. Those highlight the voices that Gemini suggested as especially well-suited for educational or technical content. Personally I don't agree with some of those picks.
The video also has chapter markers, and you’ll find the links in the description so it’s easy to jump back and forth between voices.
Video link: https://youtu.be/dFE7TPF8Uu4?si=L4IagEoE5dD6Qu5O
Just sharing in case others are curious about how the voices sound side by side.
Hi, I recently started using Gemini 2.5 Pro which I think is the best AI model I have tried, I use it only for proofreading, sharing and refining Brainstorming and the like. But it bothers me a bit that I only have like 5 messages per day and then I have to wait almost 24 hours to be able to use it again.
I have searched for Google Studio Ai because I have seen that many people mention that it is as good or even better in some aspects, but it is not clear to me if it is free or not. I have tried to get information in my own language (I don't speak English) but the tutorials about it are only about how to use it for programming or take advantage of its more "Powerful" functions but they never clarify basic points such as if it is free or if it works for pure text.
I have searched online and in English publications and as far as I understand, using the Google Ai Studio page is free, only the API costs, is that correct? Did I understand it right or are there other things I should know?
Thanks in advance for your answers!
Hi. I am a FREE account. I am wandering is there a limit (daily/monthly) for generating text to voice with Google Ai Studio.
Can anyone share information or experience on it.