speech to text huggingface space - Brave Search

huggingface.co › spaces

Spaces - Hugging Face

November 11, 2025 - Fine Tuning Tools · Dataset Creation · Pose Estimation · Face Recognition · Anomaly Detection · Recommendation Systems · Character Animation · Style Transfer · Image · Clear Search · Spaces of the week · 15 Dec 2025 · Filters (0) Sort: Relevance · Running on Zero · MCP · Featured · 181 · ⚡ · Chatterbox Turbo Demo · Running on Zero · Featured · 138 · 🐨 · Generate natural-sounding speech from text ·

huggingface.co › spaces › k2-fsa › automatic-speech-recognition

Automatic Speech Recognition - a Hugging Face Space by k2-fsa

This application transcribes audio files, microphone recordings, or audio URLs into text. Users can select the language, model, decoding method, and punctuation settings. The app outputs the transc...

Discussions

[News] Text to Speech is getting CRAZY GOOD - HierSpeech++, XTTS & StyleTTS2! (huggingface)

any of these implemented over on coqui? More on reddit.com

r/MachineLearning

2

1

December 3, 2023

🎧 Listen and Compare 12 Open-Source Text-to-Speech Models (Hugging Face Space)

Nice to have it all in one place. It'd be even nicer to have an apples to apples comparison, thus all female or all male voices, instead of mixed like it's now. Maybe both? The CSM example sounds like it's full of artifacts, just like F5-TTS - and both were highlighted for speech quality. Maybe something went wrong during generation? At least Sesame can sound way better. The Llasa sample seems slightly broken - that's maybe a hint that this happens more often? Same with the background noise for MegaTTS3. Orpheus was probably standing in a large room during the generation 😉. More on reddit.com

r/LocalLLaMA

30

155

July 6, 2025

Improved Text to Speech model: Parler TTS v1 by Hugging Face

Where can I find the full list of the 34 voice names, and do you have quick audio samples for them to get an idea of each one? More on reddit.com

r/LocalLLaMA

75

238

August 8, 2024

TTS Arena - a Hugging Face Space by TTS-AGI

No TortoiseTTS/Bark? Also, generating short utterances under 150 characters is essentially a solved problem. Long form text generation on other hand is where most current TTS models starting to show cracks. I suggest adding a long form option. More on reddit.com

r/LocalLLaMA

35

73

February 25, 2024

Videos

ModelScope Text To Video Synthesis - a Hugging Face Space by ali-vilab

3 steps to run HuggingFace 🤗 "Parler TTS" AI Voice on your local ...

October 13, 2024

Hugging Face - Text to Speech - Getting started in 5 minutes - YouTube

Hugging Face : Text to Audio | Text to Speech Generation - How ...

Let's Dive into a Speech Generation with AI Models Tutorial | ...

The Best Free Text to Speech AI You've Never Heard Of (Open Source) ...

huggingface.co › models

Text-to-Speech Models – Hugging Face

2 days ago - Text-to-Speech • 5B • Updated 2 days ago • 1.69k • 46 · Text-to-Speech • 3B • Updated Sep 1 • 517k • 2.09k · Text-to-Speech • Updated Apr 10 • 3.96M • • 5.4k · Text-to-Speech • Updated Dec 11, 2023 • 6.41M • 3.24k · Text-to-Speech • Updated 1 day ago • 18 • 24 ·

huggingface.co › spaces › hexgrad › Kokoro-TTS

Kokoro TTS - a Hugging Face Space by hexgrad

This app turns your text into natural-sounding speech. You input text and choose a voice, then it outputs audio. You can also adjust the speed and select from various voices.

huggingface.co › spaces › openai › whisper

Whisper - a Hugging Face Space by openai

Upload or record an audio file, or provide a YouTube video link to convert it into text. Choose between transcribing or translating the audio.

huggingface.co › spaces › mrfakename › E2-F5-TTS

F5-TTS - a Hugging Face Space by mrfakename

Upload an audio clip and corresponding text, then input new text to hear it spoken in the same voice. Optionally remove silence from the output.

huggingface.co › spaces › maobadi › speech-to-text

Speech To Text - a Hugging Face Space by maobadi

git clone https://huggingface.co/spaces/maobadi/speech-to-text · PowerShell uv uvx · # Make sure the hf CLI is installed powershell -ExecutionPolicy ByPass -c "irm https://hf.co/cli/install.ps1 | iex" # Download the Space hf download maobadi/speech-to-text --repo-type=space ·

huggingface.co › models

Automatic Speech Recognition Models – Hugging Face

Text-to-Speech · Text-to-Audio · Automatic Speech Recognition · Audio-to-Audio · Audio Classification · Voice Activity Detection · Tabular · Tabular Classification · Tabular Regression · Time Series Forecasting · Reinforcement Learning · Reinforcement Learning ·

Find elsewhere

Google Bing Mojeek

huggingface.co › spaces › Xenova › whisper-web

Whisper Web - a Hugging Face Space by Xenova

Whisper Web turns your spoken words into written text. Simply speak into your microphone, and the app will transcribe what you say into text for you.

huggingface.co › spaces › balacoon › tts

Text-to-Speech - a Hugging Face Space by balacoon

Enter text and select a model and speaker to generate speech. Listen to the synthesized audio result.

huggingface.co › tasks › text-to-speech

What is Text-to-Speech? - Hugging Face

TTS models can be extended to have a single model that generates speech for multiple speakers and multiple languages. ... I love audio models on the Hub! ... Text-to-Speech (TTS) models can be used in any speech-enabled application that requires converting text to speech imitating human voice.

huggingface.co › docs › transformers › tasks › text-to-speech

If you are looking to fine-tune a TTS model, the only text-to-speech models currently available in 🤗 Transformers are SpeechT5, FastSpeech2Conformer, Dia and CSM though more will be added in the future. SpeechT5 is pre-trained on a combination of speech-to-text and text-to-speech data, allowing it to learn a unified space of hidden representations shared by both text and speech.

huggingface.co › spaces › aicryptogroup › text-to-speech

Text To Speech - a Hugging Face Space by aicryptogroup

Your new space has been created, follow these steps to get started (or read the full documentation) ... # Make sure hf CLI is installed: pip install -U "huggingface_hub[cli]" hf download aicryptogroup/text-to-speech --repo-type=space

huggingface.co › spaces › vladocar › Text-to-Speech

Text To Speech - a Hugging Face Space by vladocar

Discover amazing ML apps made by the community

huggingface.co › spaces › suno › bark

Bark - a Hugging Face Space by suno

Convert any text into highly realistic, multilingual speech. Provide the text and choose a voice or accent to hear the generated audio.

huggingface.co › learn › audio-course › chapter6 › pre-trained_models

Pre-trained models for text-to-speech - Hugging Face Audio Course

In this section, we’ll explore how to use these pre-trained models in the Transformers library for TTS. SpeechT5 is a model published by Junyi Ao et al. from Microsoft that is capable of handling a range of speech tasks. While in this unit, we focus on the text-to-speech aspect, this model can be tailored to speech-to-text tasks (automatic speech recognition or speaker identification), as well as speech-to-speech (e.g.

huggingface.co › spaces › elevenlabs › tts

ElevenLabs TTS - a Hugging Face Space by elevenlabs

Enter text and select a voice to generate spoken audio. The app supports multiple languages and voices, and outputs an audio file.

huggingface.co › spaces › MattGPT › Text-2-Speech

Text-to-Speech - a Hugging Face Space by MattGPT

Discover amazing ML apps made by the community

huggingface.co › spaces › Matthijs › speecht5-tts-demo

SpeechT5 Speech Synthesis Demo - a Hugging Face Space by Matthijs

Discover amazing ML apps made by the community

huggingface.co › blog › speecht5

Speech Synthesis, Recognition, and More With SpeechT5

February 8, 2023 - If you want to jump right in, here are some demos on Spaces: ... SpeechT5 is not one, not two, but three kinds of speech models in one architecture. ... The main idea behind SpeechT5 is to pre-train a single model on a mixture of text-to-speech, speech-to-text, text-to-text, and speech-to-speech data.