🌐
Hugging Face
huggingface.co › zalopay › vietnamese-tts
zalopay/vietnamese-tts · Hugging Face
This is a Vietnamese Text-to-Speech (TTS) model trained to generate natural-sounding Vietnamese speech from text.
🌐
Hugging Face
huggingface.co › facebook › mms-tts-vie
facebook/mms-tts-vie · Hugging Face
This repository contains the Vietnamese (vie) language text-to-speech (TTS) model checkpoint.
🌐
Hugging Face
huggingface.co › dangvansam › viet-tts
dangvansam/viet-tts · Hugging Face
VietTTS is an open-source toolkit providing the community with a powerful Vietnamese TTS model, capable of natural voice synthesis and robust voice cloning. Designed for effective experimentation, VietTTS supports research and application in Vietnamese voice technologies. TTS: Text-to-Speech generation with any voice via prompt audio
🌐
Hugging Face
huggingface.co › hynt › F5-TTS-Vietnamese-100h
hynt/F5-TTS-Vietnamese-100h · Hugging Face
A compact fine-tuned version of F5-TTS trained on 150 hours of Vietnamese speech.
🌐
GitHub
github.com › thinhlpg › vixtts-demo
GitHub - thinhlpg/vixtts-demo: A Vietnamese Voice Cloning Text-to-Speech Model ✨
👉 Truy cập https://huggingface.co/spaces/thinhlpg/vixtts-demo để dùng ngay mà không cần cài đặt. viXTTS is a text-to-speech voice generation tool that offers voice cloning voices in Vietnamese and other languages.
Starred by 498 users
Forked by 206 users
Languages   Jupyter Notebook 58.4% | Python 37.3% | Shell 4.3%
🌐
Hugging Face
huggingface.co › vinai › PhoWhisper-large
vinai/PhoWhisper-large · Hugging Face
We introduce PhoWhisper in five versions for Vietnamese automatic speech recognition. PhoWhisper's robustness is achieved through fine-tuning the multilingual Whisper on an 844-hour dataset that encompasses diverse Vietnamese accents. Our experimental study demonstrates state-of-the-art performances of PhoWhisper on benchmark Vietnamese ASR datasets. Please cite our PhoWhisper paper when it is used to help produce published results or is incorporated into other software:
🌐
Hugging Face
huggingface.co › nguyenvulebinh › wav2vec2-base-vietnamese-250h
nguyenvulebinh/wav2vec2-base-vietnamese-250h · Hugging Face
Our models are pre-trained on 13k hours of Vietnamese youtube audio (un-label data) and fine-tuned on 250 hours labeled of VLSP ASR dataset on 16kHz sampled speech audio.
🌐
Hugging Face
huggingface.co › hynt › F5-TTS-Vietnamese-ViVoice
hynt/F5-TTS-Vietnamese-ViVoice · Hugging Face
A compact fine-tuned version of F5-TTS trained on 1000 hours of Vietnamese speech.
🌐
Hugging Face
huggingface.co › datasets › ntt123 › viet-tts-dataset
ntt123/viet-tts-dataset · Datasets at Hugging Face
🔔🔔🔔 visit https://github.com/NTT123/vietTTS for a vietnamese TTS library (included pretrained models). 🔔🔔🔔 · The text is from a collection of novels and short stories from the author "Vu Trong Phung." The text is in public domain. The audio is generated by Google Text-to-Speech offline engine on Android.
🌐
Hugging Face
huggingface.co › thuongvv › VieNeu-TTS
thuongvv/VieNeu-TTS · Hugging Face
... A GGUF version is also planned for the earliest possible release. Current release: VieNeu-TTS-140h (stable & production-ready) VieNeu-TTS is an on-device Vietnamese Text-to-Speech (TTS) model with instant voice cloning.
Find elsewhere
🌐
Hugging Face
huggingface.co › spaces › ntt123 › vietTTS
VietTTS - a Hugging Face Space by ntt123
This app turns Vietnamese text into spoken words. Enter your text, and it will generate an audio clip of the spoken version. Ideal for reading stories or documents aloud.
🌐
GitHub
github.com › NTT123 › vietTTS
GitHub - NTT123/vietTTS: Vietnamese Text to Speech library
Duration model + Acoustic model + HiFiGAN vocoder for vietnamese text-to-speech application. Online demo at https://huggingface.co/spaces/ntt123/vietTTS.
Starred by 247 users
Forked by 103 users
Languages   Python 88.6% | Jupyter Notebook 10.3% | Shell 1.1%
🌐
Hugging Face
huggingface.co › pnnbao-ump › VieNeu-TTS
pnnbao-ump/VieNeu-TTS · Hugging Face
VieNeu-TTS is an advanced on-device Vietnamese Text-to-Speech (TTS) model with instant voice cloning.
🌐
Hugging Face
huggingface.co › collections › doof-ferb › vietnamese-speech-dataset-65c6af8c15c9950537862fa6
Vietnamese speech dataset - a doof-ferb Collection
Vietnamese speech dataset · updated Jul 8 · for any speech-related tasks including but not limited to: speech-to-text & text-to-speech, speech classification, speaker verification, etc. Upvote · 26 · +16 · Updated Jun 14, 2023 • 256 • 13 · Viewer • Updated Apr 20 • 56.4k • 1.1k • 11 ·
🌐
Hugging Face
huggingface.co › doannguyenmmo › VI-TEXT-TO-SPEECH
doannguyenmmo/VI-TEXT-TO-SPEECH · Hugging Face
A compact fine-tuned version of F5-TTS trained on 100 hours of Vietnamese speech.
🌐
Hugging Face
huggingface.co › spaces › nam194 › text-to-speech
Lightweight Vietnamese text to speech - a Hugging Face Space by nam194
This app converts Vietnamese text into speech. Users input text, adjust sentence spacing and reading speed, and get synthesized audio output.
🌐
Hugging Face
huggingface.co › ntdgo › ttsvi
ntdgo/ttsvi · Hugging Face
viⓍTTS is a voice generation model that lets you clone voices into different languages by using just a quick 6-second audio clip. This model is fine-tuned from the XTTS-v2.0.3 model by expanding the tokenizer to Vietnamese and fine-tuning ...
🌐
Hugging Face
huggingface.co › spaces › toandev › F5-TTS-Vietnamese
F5-TTS-Vietnamese - a Hugging Face Space by toandev
This app turns Vietnamese text into speech using a reference audio sample. Users provide an audio file and the text they want to convert, and get a synthesized audio file and a spectrogram image as...
🌐
Hugging Face
huggingface.co › spaces › ntt123 › Vietnam-female-voice-TTS
Vietnam Female Voice TTS - a Hugging Face Space by ntt123
This application converts written Vietnamese text into speech using a female voice. Users input text, and the app outputs an audio clip of the text being read aloud.
🌐
Hugging Face
huggingface.co › facebook › tts_transformer-vi-cv7
facebook/tts_transformer-vi-cv7 · Hugging Face
Text-to-Speech · Fairseq · common_voice · Vietnamese · audio · arxiv: 1809.08895 · arxiv: 2109.06912 · Model card Files Files and versions · xet Community · Use this model · Transformer text-to-speech model from fairseq S^2 (paper/code): Vietnamese ·