Google Adds Multi-Speaker TTS to AI Studio & API (Gemini 2.5 Pro/Flash) - Great for Podcasts!
Gemini 2.5 Pro text to speech
I just tried Google Gemini 3 voice mode and it is far better than ChatGPT. OpenAI, you have to raise your game.
Gemini Speech Generation
Videos
Google has added speech generation capabilities to Google AI Studio and API. It supports both single-speaker and multi-speaker text-to-speech (gemini-2.5-pro-preview-tts, gemini-2.5-flash-preview-tts)
This means we can now create podcasts similar to what NotebookLM does. I tried it, and it's really great.
- I took a document, loaded it into Gemini 2.5 Pro, and asked it to generate a podcast with two speakers based on this document.
- Then, I took this script and loaded it into a text-to-speech model and received the perfect podcast.
I am greatly impressed.
I want to use Gemini 2.5 Pro text to speech for my monetized Youtube videos. The current Preview model in Google AI Studio is only for personal use and not for commercial use. I am willing to pay for it.
I am new to Google AI / Gemini.
Can you explain to me, like I am 5, how can I access the Gemini 2.5 Pro text to speech after it disappeared from the Google AI Studio?
I signed up for Google Cloud AI Vertex, and can see the TTS service using Chirp, which is not the Gemini 2.5 Pro TTS.
Will the same user interface, where you can enter style prompts and download the audio be available in Google Cloud?
There are so many information online, but I didn't find the answer. I hope there is a service for commercial use similar to the Google AI Studio interface.