๐ŸŒ
Microsoft Learn
learn.microsoft.com โ€บ en-us โ€บ azure โ€บ ai-services โ€บ speech-service โ€บ text-to-speech
Text to speech overview - Speech service - Foundry Tools | Microsoft Learn
Text to speech enables your ... is also known as speech synthesis. Use human like standard voices out of the box, or create a custom voice that's unique to your product or brand....
Discussions

I swear, the Microsoft Text to Speech voices are hilarious
Hey! Be a cool member of this subreddit and respectful to everyone, avoid disruptive threads and report discriminatory comments. Also we have a Discord!!??!! https://discord.gg/Z4M5gcDeua I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns. More on reddit.com
๐ŸŒ r/GenAlpha
2
5
December 23, 2024
azure - Speech-to-text recognition of multiple voices with Microsoft Speech API? - Stack Overflow
I would like to know if Microsoft Speech API on Python supports multiple voices recognition. I saw the beta of SDK Speaker Recognition, but I was wondering if this feature was already in the Speech... More on stackoverflow.com
๐ŸŒ stackoverflow.com
Guide on how to use Microsoft Azure's Text-to-Speech engine.
Additional Note: This feature is actually built-in Microsoft Edge (on Windows) (but since this is an android sub I'll just put the guide on how to do in this comment). On Windows/Desktop: Just open your novel in Microsoft Edge (maybe via Royal Road, WuxiaWorld, Google Play Books, etc.), open the context menu (right click anywhere in the website) then click Read Aloud. You can choose from a variety of voices by clicking voice options on the top right/ (e.g. Microsoft Christopher Online (Natural) - English (US), which is my favourite voice among the ones available). More on reddit.com
๐ŸŒ r/Android
119
307
January 7, 2023
Does anyone know how to get old text to speech voices?
Here's what I found. It might be interesting. https://www.reddit.com/r/HalfLife/comments/9j427d/the_vox_investigation_continues_willowtalk_test/?utm_source=share&utm_medium=web2x&context=3 More on reddit.com
๐ŸŒ r/microsoft
2
5
February 27, 2023
๐ŸŒ
Tetyys
tetyys.com โ€บ SAPI4
Online Microsoft Sam TTS Generator
Microsoft Sam TTS Generator is an online interface for part of Microsoft Speech API 4.0 which was released in 1998. Select your voice. Note that BonziBUDDY voice is actually an "Adult Male #2" with a specific pitch and speed. Select your pitch and speed. All voices have lower and upper pitch and speed limits. Enter your text and press "Say it". Wait for generated audio appear in audio player. It should be done nearly instantly, as the interface tries to generate audio at x16777215 real-time.
๐ŸŒ
OpenAI
platform.openai.com โ€บ docs โ€บ guides โ€บ text-to-speech
Text to speech | OpenAI API
15 hours ago - If you're using the Realtime API, note that the set of available voices is slightly differentโ€”see the realtime conversations guide for current realtime voices. The Speech API provides support for realtime audio streaming using chunk transfer encoding. This means the audio can be played before the full file is generated and made accessible. Stream spoken audio from input text directly to your speakers
The Microsoft text-to-speech voices are speech synthesizers provided for use with applications that use the Microsoft Speech API (SAPI) or the Microsoft Speech Server Platform. There are client, server, and mobile versions โ€ฆ Wikipedia
๐ŸŒ
Wikipedia
en.wikipedia.org โ€บ wiki โ€บ Microsoft_text-to-speech_voices
Microsoft text-to-speech voices - Wikipedia
November 4, 2025 - The Microsoft text-to-speech voices are speech synthesizers provided for use with applications that use the Microsoft Speech API (SAPI) or the Microsoft Speech Server Platform. There are client, server, and mobile versions of Microsoft text-to-speech voices. Client voices are shipped with Windows ...
๐ŸŒ
Chrome Web Store
chromewebstore.google.com โ€บ detail โ€บ read-aloud-a-text-to-spee โ€บ hdhinadidafjejdhmfkjgnolgimiaplp
Read Aloud: A Text to Speech Voice Reader - Chrome Web Store
Read Aloud helps users who prefer ... you to select from a variety of text-to-speech voices, including native voices provided by the browser and AI voices from cloud providers such as Google Wavenet, Amazon Polly, IBM Watson, Microsoft Azure, ...
Find elsewhere
๐ŸŒ
Microsoft Support
support.microsoft.com โ€บ en-us โ€บ windows โ€บ appendix-a-supported-languages-and-voices-4486e345-7730-53da-fcfe-55cc64300f01
Appendix A: Supported languages and voices - Microsoft Support
Note: If you encounter into any issues during the Narrator Natural Voices setup process, see Appendix G: Troubleshooting Narrator Natural Voices setup issues. Next: Appendix B: Narrator keyboard commands and touch gestures ... The following table explains what languages and text-to-speech (TTS) voices are available in the latest version of Windows.
๐ŸŒ
Reddit
reddit.com โ€บ r/genalpha โ€บ i swear, the microsoft text to speech voices are hilarious
r/GenAlpha on Reddit: I swear, the Microsoft Text to Speech voices are hilarious
December 23, 2024 - Guide on how to use Microsoft Azure's Text-to-Speech engine. ... What are some of the words and phrases people use in conversation that absolutely wind you up to the point you might have to never speak to them again? ... Update: My boss cloned my voice using a text-to-speech software and used it to put words in my mouth.
๐ŸŒ
Microsoft Azure
azure.microsoft.com โ€บ en-us โ€บ products โ€บ ai-foundry โ€บ tools โ€บ speech
Azure Speech in Foundry Tools | Microsoft Azure
1 month ago - Explore Azure Speech in Foundry Tools(formerly AI Speech) for voice recognition and text to speech. Build multilingual AI apps with customized speech models.
๐ŸŒ
Genesys Cloud
help.mypurecloud.com โ€บ announcements โ€บ deprecation: genesys enhanced tts โ€“ microsoft azure and google voices
Deprecation: Genesys Enhanced TTS - Microsoft Azure and Google Voices - Genesys Cloud Resource Center
April 13, 2025 - You can either continue using the same TTS voices by migrating to Bring Your Own Technology (BYOT) model or switch to a Genesys Enhanced TTS voice offering from Amazon Polly. Additional information regarding BYOT-A will be shared soon. All Genesys Enhanced WaveNet and Neural TTS voices without the Polly prefix. The following Microsoft Azure Thai, Basque, and Welsh voices remain supported until a later date.
๐ŸŒ
Hugging Face
huggingface.co โ€บ microsoft โ€บ VibeVoice-1.5B
microsoft/VibeVoice-1.5B ยท Hugging Face
2 weeks ago - VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in traditional Text-to-Speech (TTS) systems, particularly in ...
๐ŸŒ
Chrome Web Store
chromewebstore.google.com โ€บ detail โ€บ audiotts-simple-text-to-s โ€บ lhbdjaomnaobfljmhkmcfhhnihaaangh
AudioTTS - Simple Text to Speech Downloader - Chrome Web Store
2 weeks ago - Save audio from various text-to-speech tools for personal use, with support for platforms like ElevenLabs, Azure, Google, and more. After installing this extension, simply visit one of our supported sites listed below and begin generating voice from your text ๐ŸŒฑ: ๐Ÿค– Microsoft Azure https://azure.microsoft.com/en-us/products/ai-services/ai-speech ๐Ÿค– Google TTS https://www.gstatic.com/cloud-site-ux/text_to_speech/text_to_speech.min.html ๐Ÿค– ChatGPT from OpenAI ๐Ÿ‘‰ You need to click the speaker button download the voice https://chatgpt.com/c/your-conversation-id We have a lot of supported sites, see full list: ๐Ÿ‘‰ https://pastebin.com/6yi6qbnt See common FAQ: ๐Ÿ‘‰ https://pastebin.com/R8cpgwLV Supported browsers: ๐Ÿช All browsers that support Chrome extension installation (including Android, iOS, and Windows) are compatible with AudioTTS.
๐ŸŒ
GitHub
microsoft.github.io โ€บ VibeVoice
VibeVoice: A Frontier Open-Source Text-to-Speech Model
VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in traditional Text-to-Speech (TTS) systems, particularly in scalability, speaker consistency, and natural turn-taking.
๐ŸŒ
Semantix
semantix.com โ€บ home โ€บ blog โ€บ text to speech software and apps: the complete guide โ€บ how to use text-to-speech for windows devices
How to use text-to-speech for Windows devices | Semantix
December 1, 2023 - Robo Talk is a free-to-use text-to-speech app obtainable from the Microsoft Store. Itโ€™s an easy converter that allows you to convert and save the file. It uses the traditional method to open a file from your library, and it then narrates the ...
๐ŸŒ
Microsoft Learn
learn.microsoft.com โ€บ en-us โ€บ azure โ€บ ai-services โ€บ speech-service โ€บ language-support
Language support - Speech service - Foundry Tools | Microsoft Learn
November 6, 2025 - The following tables summarize language support for speech to text, text to speech, pronunciation assessment, speech translation, and more service features. You can also get a list of locales and voices supported for each specific region or endpoint via:
๐ŸŒ
Microsoft Learn
learn.microsoft.com โ€บ en-us โ€บ azure โ€บ ai-services โ€บ speech-service โ€บ custom-neural-voice
Custom voice overview - Speech service - Foundry Tools | Microsoft Learn
August 7, 2025 - Custom voice is a text to speech feature that allows you to create a one-of-a-kind, customized, synthetic voice for your applications. You provide your own audio data as a sample.
๐ŸŒ
TTS Tool
ttstool.com
TTS Tool
It is not designed for synthesizing documents or large amounts of text. Please use the Amazon Polly or Google Wavenet tools for that purpose. ... pause: {{#row.pause.value}}s voice: {{#row.voice.name}} volume: {{#row.volume.desc}} rate: {{#row.rate.desc}} pitch: {{#row.pitch.desc}}
๐ŸŒ
Stack Overflow
stackoverflow.com โ€บ questions โ€บ 54370009 โ€บ speech-to-text-recognition-of-multiple-voices-with-microsoft-speech-api
azure - Speech-to-text recognition of multiple voices with Microsoft Speech API? - Stack Overflow
When I transcribed an audio of a two persons conversation (a female and a male) using Microsoft Speech-to-Text, the recognized text was not split when the voice changes.
๐ŸŒ
Reddit
reddit.com โ€บ r/android โ€บ guide on how to use microsoft azure's text-to-speech engine.
r/Android on Reddit: Guide on how to use Microsoft Azure's Text-to-Speech engine.
January 7, 2023 -

edit: I've outlined 5 different ways to do this, all with differing pros and cons

special thanks to this post by u/jiayounokim Method 01:

  • Link to download APK is here v0.5, link is in chinese: here's a screenshot of the english translation

  • After downloading and installing, select this option shown in the image here

  • This will open the Preferred engine settings, select the engine shown in the image here

  • Change the language by clicking this setting

  • Input this code block to change the language into en-US-ChristopherNeural:

<speak version="1.0" xml:lang="en-US"><voice name="en-US-ChristopherNeural"><prosody rate="${(rate-100)?c}%" pitch="${(pitch-100)?c}%"><mstts:express-as style="serious">${text}</mstts:express-as></prosody></voice></speak>

  • Use an epub reader with TTS feature (like Google Play Books), then open TTS feature and enjoy!

  • If you want to change it to a different language/voice, try out other voices here and then get the id of the name in here.

    • e.g. I tried and want to use the voice package, Monica (Neural) in English (United States)

    • I will look for the id of Monica (Neural) here

    • The language is "en-US", and the id is "en-US-MonicaNeural".

    • I will now substitute these two information to the code block below.

<speak version="1.0" xml:lang="language here"><voice name="id here"><prosody rate="${(rate-100)?c}%" pitch="${(pitch-100)?c}%"><mstts:express-as style="serious">${text}</mstts:express-as></prosody></voice></speak>

which will make it:

<speak version="1.0" xml:lang="en-US"><voice name="en-US-MonicaNeural"><prosody rate="${(rate-100)?c}%" pitch="${(pitch-100)?c}%"><mstts:express-as style="serious">${text}</mstts:express-as></prosody></voice></speak>

If the TTS is too slow for you, you can change it in Android Settings > Accessiblity > Text-To-Speech > Speech Rate. Or maybe your epub reader has a built-in speech rate and pitch setting (like Moon Reader+).

found a chinese thread which is the origin of the app


Why use Microsoft Azure's TTS?

  • It's much better and sounds more natural than the default TTS engine (Google TTS)

  • Has a variety of voices which you can choose from based on your preferences.

Why use TTS at all?

  • If you love reading books, this TTS engine is so good that it's practically turns all your books into a decent audiobooks. Official audiobook are still better but the voices sound natural enough that it gets the job done.


Edit:

I found out Android's Microsoft Edge also has this feature, it has less voices but it's still has a variety of good ones. Method 02: Using Android's Microsoft Edge

  • Just open your novel in Microsoft Edge (maybe via Royal Road, WuxiaWorld, Google Play Books, etc.),

  • open the options (three dots in bottom middle of the screen)

  • then click Read Aloud.

  • You can choose from a variety of voices by clicking voice options on the top right.

    • (e.g. Microsoft Christopher Online (Natural) - English (US), which is my favourite voice among the ones available).


Edit: I saw that the system-wide TTS isn't working for some of you, here's an alternative I found.

Method 03: System wide TTS engine using TTS Server app

  • Download the zip appropriate apk file or just the largest sized one if you don't know in this github link (note: you have to login to your github account for link to work changed the link, you don't have to log in now.)

  • Extract the zip file

  • Install the apk

  • The app is in broken English so it's a bit easier to navigate, although you might need to change it in the app settings.

  • Click the "+" on the top right

  • Change the language in to your desired choice (e.g. English US)

  • Choose your chosen voice

    • You can get a preview of the voice by putting words in the preview form and then hit play.

  • Exit out of the app and go to Android Settings

  • Search for Text to Speech or go to System > Language & Keyboard > Text-to-Speech Output (it will vary on your phone but this is the general idea).

  • Change the engine to TTS Server

  • Use an epub reader with TTS feature (like Google Play Books), then open TTS feature and enjoy!

  • Too slow? try this: Hamburger Menu (Top Right) > Settings > Turn on User DNS to resolve API IP


"This is a lot of work, is there an easier way?"

Well you can change the voice of the default TTS Engine on your phone (mostly Google TTS or Samsung TTS). I noticed that the default voice is bad, but some of the other voices is a slight upgrade.

Method 04: Adjusting Google Speech Services/Samsung TTS Engine

  • go to Android Settings

  • Search for Text to Speech or go to System > Language & Keyboard > Text-to-Speech Output (it will vary on your phone but this is the general idea).

  • Click on the setting icon near the TTS Engine

  • Find Install Voice Data

  • Find your language of choice (English US, English UK, etc.)

  • Download/Choose among the various voices in there

  • Go back and then you can preview the voice. You can also change the speed and pitch of the voice.

  • Use an epub reader with TTS feature (like Google Play Books), then open TTS feature and enjoy!

Bonus: If you're using Google Play Books for book reading and using Google TTS, you can go to Google Play books setting and turn on "High Quality Voice" (Idk how much of a difference this makes but it should be better).


Method 05: another TTS app

  • Download the TTS Apk in this github link

  • Install the TTS

  • In your Android Settings, find Text-To-Speech settings.

  • Change engine from the previously installed Text-To-Speech to this new one.

  • Use an epub reader with TTS feature (like Google Play Books), then open TTS feature and enjoy!

  • If you want to change the default voice (Jenny+) to another,

    • Open the installed TTS (it's in Chinese so it's a bit hard to navigate).

    • Next Click this to enable customization.

    • Double click the name of your choice to choose and hear an example of said voice (i.e. Christopher, Sonia, etc.)

    • There are other settings in there, here's a link to an album of screenshots and their corresponding translation.

(I'll update the links to the pictures later.)