Videos
I want to use Gemini 2.5 Pro text to speech for my monetized Youtube videos. The current Preview model in Google AI Studio is only for personal use and not for commercial use. I am willing to pay for it.
I am new to Google AI / Gemini.
Can you explain to me, like I am 5, how can I access the Gemini 2.5 Pro text to speech after it disappeared from the Google AI Studio?
I signed up for Google Cloud AI Vertex, and can see the TTS service using Chirp, which is not the Gemini 2.5 Pro TTS.
Will the same user interface, where you can enter style prompts and download the audio be available in Google Cloud?
There are so many information online, but I didn't find the answer. I hope there is a service for commercial use similar to the Google AI Studio interface.
Google has added speech generation capabilities to Google AI Studio and API. It supports both single-speaker and multi-speaker text-to-speech (gemini-2.5-pro-preview-tts, gemini-2.5-flash-preview-tts)
This means we can now create podcasts similar to what NotebookLM does. I tried it, and it's really great.
- I took a document, loaded it into Gemini 2.5 Pro, and asked it to generate a podcast with two speakers based on this document.
- Then, I took this script and loaded it into a text-to-speech model and received the perfect podcast.
I am greatly impressed.
I just had a long streaming conversation with Gemini 2.5 Flash Preview Native Audio Dialog in which he lied to me and wanted to download it and show it to support and upload it to reddit. Is there any way to do that? I have autosave on but this specific conversation stopped when I told Gemini I was going to report to support and it isnt getting saved in Google drive. Is there any way to have it other than OBS recording? Thanks