Videos
I want to use Gemini 2.5 Pro text to speech for my monetized Youtube videos. The current Preview model in Google AI Studio is only for personal use and not for commercial use. I am willing to pay for it.
I am new to Google AI / Gemini.
Can you explain to me, like I am 5, how can I access the Gemini 2.5 Pro text to speech after it disappeared from the Google AI Studio?
I signed up for Google Cloud AI Vertex, and can see the TTS service using Chirp, which is not the Gemini 2.5 Pro TTS.
Will the same user interface, where you can enter style prompts and download the audio be available in Google Cloud?
There are so many information online, but I didn't find the answer. I hope there is a service for commercial use similar to the Google AI Studio interface.
I found out the TTS on the answers that Gemini gives on the mobile app are quite good, which can br used as a free TTS reader. I assume it is generated with some of their AI voice models, unlike most free generic TTS services.
It's kind of a loop hole in the system.
But the in the Web Version, it just reads it with the normal Google Translate TTS voice, which is very weird..
The TTS voice also changes gender depending on which account I am on 😂.
Idk if anyone had this experience before, but I would appreciate any help.
I also tried the chatgpt web version, but it only read it as an english reader, no matter the output text language.
Google has added speech generation capabilities to Google AI Studio and API. It supports both single-speaker and multi-speaker text-to-speech (gemini-2.5-pro-preview-tts, gemini-2.5-flash-preview-tts)
This means we can now create podcasts similar to what NotebookLM does. I tried it, and it's really great.
- I took a document, loaded it into Gemini 2.5 Pro, and asked it to generate a podcast with two speakers based on this document.
- Then, I took this script and loaded it into a text-to-speech model and received the perfect podcast.
I am greatly impressed.