🌐
Google Cloud
cloud.google.com › text-to-speech
Text-to-Speech AI: Lifelike Speech Synthesis | Google Cloud
October 1, 2025 - Turn text into natural-sounding speech in 220+ voices across 40+ languages and variants with an API powered by Google’s machine learning technology.
🌐
Google AI Studio
aistudio.google.com › generate-speech
Generate Speech
Sign in · Use your Google Account · Not your computer? Use Guest mode to sign in privately. Learn more about using Guest mode · Create account
Discussions

Different Speech to Text models offered by Google
I’m a bit puzzled by the different Speech to Text models offered by Google. Can you help me understand the different offers? There is the old one on https://cloud.google.com/speech-to-text This is not an LLM but a purpose-built “old” type architecture, right? More on discuss.ai.google.dev
🌐 discuss.ai.google.dev
1
February 28, 2025
How to generate high-quality text-to-speech for free
Thank you so much! I used to read my books to my kids as bedtime stories, but they're adults now, so... More on reddit.com
🌐 r/AllThingsEditing
14
22
April 16, 2022
[D] Speech to Text - Google API or something else?
Mycroft has a good run down of options. https://mycroft-ai.gitbook.io/docs/using-mycroft-ai/customizations/stt-engine More on reddit.com
🌐 r/MachineLearning
7
1
February 11, 2021
They completely downgraded the text-to-speech demo, and I don't know what to do
Google is competly wiffing on everything ML. More on reddit.com
🌐 r/googlecloud
4
3
September 24, 2024
🌐
Google Cloud
cloud.google.com › speech-to-text
Speech-to-Text API: speech recognition and transcription | Google Cloud
Accurately convert voice to text in over 85+ languages and variants using Google AI API.
🌐
NaturalReader
naturalreaders.com › online
Free Text to Speech Online with Realistic AI Voices
Convert text into ultra-realistic audio. Have any text read aloud with AI Voices. AI text reader for pdfs, books, documents, and webpages.
🌐
Google
docs.cloud.google.com › ai and ml › vertex ai › generative ai on vertex ai › convert text to speech
Convert text to speech | Generative AI on Vertex AI | Google Cloud Documentation
In the Vertex AI section of the Google Cloud console, go to the Vertex AI Studio page. ... Click Generate speech. Select the Text-to-speech tab.
🌐
Softonic
google-texttospeech.en.softonic.com › home › android › utilities & tools › google text-to-speech
Google Text-to-Speech for Android - Download
3 days ago - Google Text-to-Speech for Android, free and safe download. Google Text-to-Speech latest version: Google Text-to-Speech. Google Text-to-Speech is an ap
Rating: 9.2/10 ​ - ​ 6 votes
🌐
Google Play
play.google.com › store › apps › details
Speech Recognition & Synthesis - Apps on Google Play
Power your device with the magic ... Speech-to-Text functionality Speech Recognition provides speech-to-text functionality to Google and other third party apps to convert what you say to text....
Rating: 3.9 ​ - ​ 4.18M votes
Find elsewhere
🌐
Google AI
ai.google.dev › gemini api › speech generation (text-to-speech)
Speech generation (text-to-speech) | Gemini API | Google AI for Developers
1 week ago - The Gemini API can transform text input into single speaker or multi-speaker audio using native text-to-speech (TTS) generation capabilities.
🌐
Google Workspace
workspace.google.com › marketplace › app › ai_transcription_realtime_speech_to_text › 501744932484
AI Transcription & Real-time Speech to Text - Google Workspace Marketplace
AI Transcription transcribes speech to text in real time, or transcribes audio or video to text. Powered by OpenAI's Whisper model. Features: ➤ Supports real-time transcription, instantly converting speech to text.
🌐
Google
docs.cloud.google.com › ai and ml › vertex ai › generative ai on vertex ai › google models
Google models | Generative AI on Vertex AI | Google Cloud Documentation
2 weeks ago - Features adaptive thinking, a 1M token context window, and integrated grounding for sophisticated multimodal problem solving. preview Gemini 3 Pro Image High-fidelity image generation with reasoning-enhanced composition. Supports legible text rendering, complex multi-turn editing, and character consistency using up to 14 reference inputs.
🌐
Dictation.io
dictation.io › speech
Voice Notepad - Speech to Text with Google Speech Recognition
Please open dictation.io inside Google Chrome to use speech recognition. ... Please follow this guide for instructions on how to unblock your microphone. ... Dictation is now publishing your note online. Please wait.. ... Speed is the rate at which the selected voice will speak your transcribed text while the pitch governs how high or low the voice speaks.
🌐
Google
blog.google › technology › developers › gemini-2-5-text-to-speech
Improving Gemini Text-to-Speech models for better control and capabilities
5 days ago - Google is releasing upgraded Gemini 2.5 Flash and Pro Text-to-Speech models with better expressiveness, pacing, and multi-speaker capabilities. These models offer improved control over style, tone, and pronunciation for various use cases. Start testing the new TTS models in Google AI Studio and the Playground today.
🌐
Langchain
docs.langchain.com › oss › python › integrations › document_loaders › google_speech_to_text
Google Speech-to-Text Audio Transcripts - Docs by LangChain
The SpeechToTextLoader allows to transcribe audio files with the Google Cloud Speech-to-Text API and loads the transcribed text into documents.
🌐
Speechify
speechify.com › text-to-speech-online
Speechify: Free Text to Speech with Humanlike AI Voices
Convert any text to speech using your favorite voice. Listen to any text, including Google Docs, articles, emails, books, fanfiction, PDFs, websites and more in over 1,000 realistic AI voices in 60+ languages and accents.
🌐
Google Support
support.google.com › docs › answer › 4492226
Type & edit with your voice - Google Docs Editors Help
You can use your voice to type and edit your document in Google Docs and your speaker notes and captions in Google Slides. This feature works with the latest versions of: Chrome Edge Safari
🌐
Google AI
discuss.ai.google.dev › google ai studio
Different Speech to Text models offered by Google - Google AI Studio - Google AI Developers Forum
February 28, 2025 - I’m a bit puzzled by the different Speech to Text models offered by Google. Can you help me understand the different offers? There is the old one on https://cloud.google.com/speech-to-text This is not an LLM but a purp…
🌐
Uberduck
uberduck.ai
AI Vocals and Text To Speech | Uberduck
Make Music, Voiceovers and Videos With AI Vocals, Text to Speech, Voice Conversion and Voice Cloning
🌐
Amazon Web Services
aws.amazon.com › products › artificial intelligence › amazon polly
AI Voice Generator and Text-to-Speech Tool - Amazon Polly - AWS
4 days ago - Amazon Polly is a fully-managed service that generates voice on demand, converting any text to an audio stream. Using deep learning technologies to convert articles, web pages, PDF documents, and other text-to-speech (TTS). Polly provides dozens of lifelike voices across a broad set of languages for you to build speech-activated applications that engage and convert.
🌐
Google Workspace
workspace.google.com › marketplace › app › text_to_speech_ai_voice_generator › 743993646804
Text to Speech - AI Voice Generator - Google Workspace Marketplace
Features: ➤ TTS modeling with AI to generate realistic speech. ➤ Supports 60+ languages. ➤ Supports Docs™, Slides™, Sheets™, PDFs, DOCX, XLSX, PPT. ➤ Privacy Policy By design, your data stays at all times on your account, never saved in our database. Your data aren’t shared with anyone, including the add-on owner. We complies with privacy laws (especially GDPR & California Privacy Act) to protect your data. ... Text to Speech - AI Voice Generator will ask for the permissions shown below.
🌐
Reddit
reddit.com › r/allthingsediting › how to generate high-quality text-to-speech for free
r/AllThingsEditing on Reddit: How to generate high-quality text-to-speech for free
April 16, 2022 -

If you like to read your text out loud to catch awkward sentences, you may want to try text-to-speech. Unfortunately the free alternatives sound horrible, and the available text-to-speech apps offering premium voices are expensive, especially if you're revising an entire novel. There is however a workaround, it's a little involved, but you only have to do it once.

Guide: How to generate text-to-speech using Google's Wavenet voices for free. (And legally.)

Wavenet is the artificial voice API used in Google assistant, among others, and sounds considerably more natural than the free alternatives. If you register a Google cloud account, you can activate the the Cloud text-to-speech API and get 1 million characters a month for free directly from Google. Search for it in the API library, and it pops right up.

Be aware that if you exceed the allotted amount of characters, you'll be charged $16 for another million. A million characters is enough for at least 150 000 words though, so you will most likely never come even near running that risk.

The trick is now to take your newly acquired characters and generate an actual voice with them. You do that with an extension to Chrome called "Wavenet for Chrome", surprisingly. Install it and head back to Google cloud to generate an API key. Instructions are provided by the extension, or can be found with a google search. Generate the key and paste it into the extension. The configuration is now done.

You access the extension via the right-click menu, so you need to use a web text editor that doesn't override it. Google docs and Word won't work. I use Wavemaker, but any simple editor will do.

Choose the voice you want in the extension and open your text in the editor. Select the part you want to generate, right-click and select "Download as MP3". This saves you from wasting characters by generating the same text over and over. Open your new file in the MP3-player of your choice and there you go. Easy peasy lemon squeezy.