๐ŸŒ
Google Cloud
cloud.google.com โ€บ text-to-speech
Text-to-Speech AI: Lifelike Speech Synthesis | Google Cloud
October 1, 2025 - Turn text into natural-sounding speech in 220+ voices across 40+ languages and variants with an API powered by Googleโ€™s machine learning technology.
๐ŸŒ
Google AI Studio
aistudio.google.com โ€บ generate-speech
Generate Speech
Sign in ยท Use your Google Account ยท Not your computer? Use Guest mode to sign in privately. Learn more about using Guest mode ยท Create account
Discussions

Different Speech to Text models offered by Google
Iโ€™m a bit puzzled by the different Speech to Text models offered by Google. Can you help me understand the different offers? There is the old one on https://cloud.google.com/speech-to-text This is not an LLM but a purpose-built โ€œoldโ€ type architecture, right? More on discuss.ai.google.dev
๐ŸŒ discuss.ai.google.dev
1
February 28, 2025
How to generate high-quality text-to-speech for free
Thank you so much! I used to read my books to my kids as bedtime stories, but they're adults now, so... More on reddit.com
๐ŸŒ r/AllThingsEditing
14
22
April 16, 2022
[D] Speech to Text - Google API or something else?
Mycroft has a good run down of options. https://mycroft-ai.gitbook.io/docs/using-mycroft-ai/customizations/stt-engine More on reddit.com
๐ŸŒ r/MachineLearning
7
1
February 14, 2021
They completely downgraded the text-to-speech demo, and I don't know what to do
Google is competly wiffing on everything ML. More on reddit.com
๐ŸŒ r/googlecloud
4
3
September 27, 2024
๐ŸŒ
Google Cloud
cloud.google.com โ€บ speech-to-text
Speech-to-Text API: speech recognition and transcription | Google Cloud
Chirp 3: Transcription was built using self-supervised training on millions of hours of audio and 28 billion sentences of text spanning 100+ languages. ... Receive real-time speech recognition results as the API processes the audio input streamed from your applicationโ€™s microphone or sent from a prerecorded audio file (inline or through Cloud Storage). Speech-to-Text uses model adaptation to improve the accuracy of frequently used words, expand the vocabulary available for transcription, and improve transcription from noisy audio.
๐ŸŒ
NaturalReader
naturalreaders.com โ€บ online
Free Text to Speech Online with Realistic AI Voices
Convert text into ultra-realistic audio. Have any text read aloud with AI Voices. AI text reader for pdfs, books, documents, and webpages.
๐ŸŒ
Google AI
ai.google.dev โ€บ gemini api โ€บ speech generation (text-to-speech)
Speech generation (text-to-speech) | Gemini API | Google AI for Developers
3 days ago - The Gemini API can transform text input into single speaker or multi-speaker audio using native text-to-speech (TTS) generation capabilities.
๐ŸŒ
Google Play
play.google.com โ€บ store โ€บ apps โ€บ details
Speech Recognition & Synthesis - Apps on Google Play
Select Speech Recognition and Synthesis from Google as your preferred voice input engine. Google Text-to-Speech functionality Speech Services powers applications to read the text on your screen aloud.
Rating: 3.9 โ€‹ - โ€‹ 4.18M votes
Find elsewhere
๐ŸŒ
Google
blog.google โ€บ technology โ€บ developers โ€บ gemini-2-5-text-to-speech
Improving Gemini Text-to-Speech models for better control and capabilities
2 days ago - Google is releasing upgraded Gemini 2.5 Flash and Pro Text-to-Speech models with better expressiveness, pacing, and multi-speaker capabilities. These models offer improved control over style, tone, and pronunciation for various use cases. Start testing the new TTS models in Google AI Studio and the Playground today.
๐ŸŒ
Google
docs.cloud.google.com โ€บ ai and ml โ€บ vertex ai โ€บ generative ai on vertex ai โ€บ convert text to speech
Convert text to speech | Generative AI on Vertex AI | Google Cloud Documentation
In the Vertex AI section of the Google Cloud console, go to the Vertex AI Studio page. ... Click Generate speech. Select the Text-to-speech tab.
๐ŸŒ
Google
docs.cloud.google.com โ€บ ai and ml โ€บ vertex ai โ€บ generative ai on vertex ai โ€บ google models
Google models | Generative AI on Vertex AI | Google Cloud Documentation
2 weeks ago - movie Veo 2 Preview Generates videos from text prompts and images, supporting inpaint and outpaint. movie Veo 2 Experimental An experimental model with features under test. Caution: MedLM is deprecated. Access to MedLM will no longer be available on or after September 29, 2025.
๐ŸŒ
Dictation.io
dictation.io โ€บ speech
Voice Notepad - Speech to Text with Google Speech Recognition
Please open dictation.io inside Google Chrome to use speech recognition. ... Please follow this guide for instructions on how to unblock your microphone. ... Dictation is now publishing your note online. Please wait.. ... Speed is the rate at which the selected voice will speak your transcribed text while the pitch governs how high or low the voice speaks.
๐ŸŒ
Langchain
docs.langchain.com โ€บ oss โ€บ python โ€บ integrations โ€บ document_loaders โ€บ google_speech_to_text
Google Speech-to-Text Audio Transcripts - Docs by LangChain
Refer to the Speech-to-Text recognizers documentation and the RecognizeRequest API reference for information on how to set a custom configuation. If you donโ€™t specify a config, the following options will be selected automatically: ... from google.cloud.speech_v2 import ( AutoDetectDecodingConfig, RecognitionConfig, RecognitionFeatures, ) from langchain_google_community import SpeechToTextLoader project_id = "<PROJECT_ID>" location = "global" recognizer_id = "<RECOGNIZER_ID>" file_path = "./audio.wav" config = RecognitionConfig( auto_decoding_config=AutoDetectDecodingConfig(), language_codes=
๐ŸŒ
Speechify
speechify.com โ€บ text-to-speech-online
Speechify: Free Text to Speech with Humanlike AI Voices
Convert any text to speech using your favorite voice. Listen to any text, including Google Docs, articles, emails, books, fanfiction, PDFs, websites and more in over 1,000 realistic AI voices in 60+ languages and accents.
๐ŸŒ
Google Support
support.google.com โ€บ docs โ€บ answer โ€บ 4492226
Type & edit with your voice - Google Docs Editors Help
You can use your voice to type and edit your document in Google Docs and your speaker notes and captions in Google Slides. This feature works with the latest versions of: Chrome Edge Safari
๐ŸŒ
Google AI
discuss.ai.google.dev โ€บ google ai studio
Different Speech to Text models offered by Google - Google AI Studio - Google AI Developers Forum
February 28, 2025 - Iโ€™m a bit puzzled by the different Speech to Text models offered by Google. Can you help me understand the different offers? There is the old one on https://cloud.google.com/speech-to-text This is not an LLM but a purpโ€ฆ
๐ŸŒ
Uberduck
uberduck.ai
AI Vocals and Text To Speech | Uberduck
Generate speech, singing, and rapping from text. Write code for text to speech, text to singing, text to rapping, and voice conversion.
๐ŸŒ
Amazon Web Services
aws.amazon.com โ€บ products โ€บ artificial intelligence โ€บ amazon polly
AI Voice Generator and Text-to-Speech Tool - Amazon Polly - AWS
1 day ago - Amazon Polly is a fully-managed service that generates voice on demand, converting any text to an audio stream. Using deep learning technologies to convert articles, web pages, PDF documents, and other text-to-speech (TTS). Polly provides dozens of lifelike voices across a broad set of languages for you to build speech-activated applications that engage and convert.
๐ŸŒ
Reddit
reddit.com โ€บ r/allthingsediting โ€บ how to generate high-quality text-to-speech for free
r/AllThingsEditing on Reddit: How to generate high-quality text-to-speech for free
April 16, 2022 -

If you like to read your text out loud to catch awkward sentences, you may want to try text-to-speech. Unfortunately the free alternatives sound horrible, and the available text-to-speech apps offering premium voices are expensive, especially if you're revising an entire novel. There is however a workaround, it's a little involved, but you only have to do it once.

Guide: How to generate text-to-speech using Google's Wavenet voices for free. (And legally.)

Wavenet is the artificial voice API used in Google assistant, among others, and sounds considerably more natural than the free alternatives. If you register a Google cloud account, you can activate the the Cloud text-to-speech API and get 1 million characters a month for free directly from Google. Search for it in the API library, and it pops right up.

Be aware that if you exceed the allotted amount of characters, you'll be charged $16 for another million. A million characters is enough for at least 150 000 words though, so you will most likely never come even near running that risk.

The trick is now to take your newly acquired characters and generate an actual voice with them. You do that with an extension to Chrome called "Wavenet for Chrome", surprisingly. Install it and head back to Google cloud to generate an API key. Instructions are provided by the extension, or can be found with a google search. Generate the key and paste it into the extension. The configuration is now done.

You access the extension via the right-click menu, so you need to use a web text editor that doesn't override it. Google docs and Word won't work. I use Wavemaker, but any simple editor will do.

Choose the voice you want in the extension and open your text in the editor. Select the part you want to generate, right-click and select "Download as MP3". This saves you from wasting characters by generating the same text over and over. Open your new file in the MP3-player of your choice and there you go. Easy peasy lemon squeezy.

๐ŸŒ
AssemblyAI
assemblyai.com โ€บ blog โ€บ google-speech-to-text-api-python
How to use Google's Speech-to-Text API to transcribe audio in Python
1 month ago - The Google Cloud Speech-to-Text API converts audio files and real-time audio streams into text using Google's AI models. The API supports over 125 languages, which competitive analysis shows is the most extensive coverage among major providers.