Hugging Face
huggingface.co › spaces
Spaces - Hugging Face
November 11, 2025 - Fine Tuning Tools · Dataset Creation · Pose Estimation · Face Recognition · Anomaly Detection · Recommendation Systems · Character Animation · Style Transfer · Image · Clear Search · Spaces of the week · 15 Dec 2025 · Filters (0) Sort: Relevance · Running on Zero · MCP · Featured · 181 · ⚡ · Chatterbox Turbo Demo · Running on Zero · Featured · 138 · 🐨 · Generate natural-sounding speech from text ·
Improved Text to Speech model: Parler TTS v1 by Hugging Face
Where can I find the full list of the 34 voice names, and do you have quick audio samples for them to get an idea of each one? More on reddit.com
[News] Text to Speech is getting CRAZY GOOD - HierSpeech++, XTTS & StyleTTS2! (huggingface)
any of these implemented over on coqui? More on reddit.com
🎧 Listen and Compare 12 Open-Source Text-to-Speech Models (Hugging Face Space)
Nice to have it all in one place. It'd be even nicer to have an apples to apples comparison, thus all female or all male voices, instead of mixed like it's now. Maybe both? The CSM example sounds like it's full of artifacts, just like F5-TTS - and both were highlighted for speech quality. Maybe something went wrong during generation? At least Sesame can sound way better. The Llasa sample seems slightly broken - that's maybe a hint that this happens more often? Same with the background noise for MegaTTS3. Orpheus was probably standing in a large room during the generation 😉. More on reddit.com
[P]Natural sounding text-to-speech, preferably free, with option to train voice on local machine with AMD GPU?
There were options up til you said AMD GPU. Basically nothing is written for training on AMD. Any solution to this problem takes out the "free" part. More on reddit.com
Videos
ModelScope Text To Video Synthesis - a Hugging Face Space by ali-vilab
18:42
3 steps to run HuggingFace 🤗 "Parler TTS" AI Voice on your local ...
01:57:29
Text-to-Speech Throwdown: Testing Hugging Face’s TTS Arena V2 ...
07:12
Hugging Face Spaces in 10 Minutes or Less - YouTube
Hugging Face
huggingface.co › docs › transformers › tasks › text-to-speech
Text to speech
If you are looking to fine-tune a TTS model, the only text-to-speech models currently available in 🤗 Transformers are SpeechT5, FastSpeech2Conformer, Dia and CSM though more will be added in the future. SpeechT5 is pre-trained on a combination of speech-to-text and text-to-speech data, allowing it to learn a unified space of hidden representations shared by both text and speech.
Hugging Face
huggingface.co › learn › audio-course › chapter6 › pre-trained_models
Pre-trained models for text-to-speech - Hugging Face Audio Course
In this section, we’ll explore how to use these pre-trained models in the Transformers library for TTS. SpeechT5 is a model published by Junyi Ao et al. from Microsoft that is capable of handling a range of speech tasks. While in this unit, we focus on the text-to-speech aspect, this model can be tailored to speech-to-text tasks (automatic speech recognition or speaker identification), as well as speech-to-speech (e.g.