text-to-speech vietnamese huggingface

This is a Vietnamese Text-to-Speech (TTS) model trained to generate natural-sounding Vietnamese speech from text.

Hugging Face

huggingface.co › facebook › mms-tts-vie

facebook/mms-tts-vie · Hugging Face

This repository contains the Vietnamese (vie) language text-to-speech (TTS) model checkpoint.

Hugging Face

huggingface.co › dangvansam › viet-tts

dangvansam/viet-tts · Hugging Face

VietTTS is an open-source toolkit providing the community with a powerful Vietnamese TTS model, capable of natural voice synthesis and robust voice cloning. Designed for effective experimentation, VietTTS supports research and application in Vietnamese voice technologies. TTS: Text-to-Speech generation with any voice via prompt audio

Hugging Face

huggingface.co › hynt › F5-TTS-Vietnamese-100h

hynt/F5-TTS-Vietnamese-100h · Hugging Face

A compact fine-tuned version of F5-TTS trained on 150 hours of Vietnamese speech.

GitHub

github.com › thinhlpg › vixtts-demo

GitHub - thinhlpg/vixtts-demo: A Vietnamese Voice Cloning Text-to-Speech Model ✨

👉 Truy cập https://huggingface.co/spaces/thinhlpg/vixtts-demo để dùng ngay mà không cần cài đặt. viXTTS is a text-to-speech voice generation tool that offers voice cloning voices in Vietnamese and other languages.

Starred by 498 users

Forked by 206 users

Languages Jupyter Notebook 58.4% | Python 37.3% | Shell 4.3%

Hugging Face

huggingface.co › vinai › PhoWhisper-large

vinai/PhoWhisper-large · Hugging Face

We introduce PhoWhisper in five versions for Vietnamese automatic speech recognition. PhoWhisper's robustness is achieved through fine-tuning the multilingual Whisper on an 844-hour dataset that encompasses diverse Vietnamese accents. Our experimental study demonstrates state-of-the-art performances of PhoWhisper on benchmark Vietnamese ASR datasets. Please cite our PhoWhisper paper when it is used to help produce published results or is incorporated into other software:

Hugging Face

huggingface.co › nguyenvulebinh › wav2vec2-base-vietnamese-250h

nguyenvulebinh/wav2vec2-base-vietnamese-250h · Hugging Face

Our models are pre-trained on 13k hours of Vietnamese youtube audio (un-label data) and fine-tuned on 250 hours labeled of VLSP ASR dataset on 16kHz sampled speech audio.

Hugging Face

huggingface.co › hynt › F5-TTS-Vietnamese-ViVoice

hynt/F5-TTS-Vietnamese-ViVoice · Hugging Face

A compact fine-tuned version of F5-TTS trained on 1000 hours of Vietnamese speech.

Hugging Face

huggingface.co › datasets › ntt123 › viet-tts-dataset

ntt123/viet-tts-dataset · Datasets at Hugging Face

🔔🔔🔔 visit https://github.com/NTT123/vietTTS for a vietnamese TTS library (included pretrained models). 🔔🔔🔔 · The text is from a collection of novels and short stories from the author "Vu Trong Phung." The text is in public domain. The audio is generated by Google Text-to-Speech offline engine on Android.

Hugging Face

huggingface.co › thuongvv › VieNeu-TTS

thuongvv/VieNeu-TTS · Hugging Face

... A GGUF version is also planned for the earliest possible release. Current release: VieNeu-TTS-140h (stable & production-ready) VieNeu-TTS is an on-device Vietnamese Text-to-Speech (TTS) model with instant voice cloning.

Find elsewhere

Google Bing Mojeek

Hugging Face

huggingface.co › spaces › ntt123 › vietTTS

VietTTS - a Hugging Face Space by ntt123

This app turns Vietnamese text into spoken words. Enter your text, and it will generate an audio clip of the spoken version. Ideal for reading stories or documents aloud.

GitHub

github.com › NTT123 › vietTTS

GitHub - NTT123/vietTTS: Vietnamese Text to Speech library

Duration model + Acoustic model + HiFiGAN vocoder for vietnamese text-to-speech application. Online demo at https://huggingface.co/spaces/ntt123/vietTTS.

Starred by 247 users

Forked by 103 users

Languages Python 88.6% | Jupyter Notebook 10.3% | Shell 1.1%

Hugging Face

huggingface.co › pnnbao-ump › VieNeu-TTS

pnnbao-ump/VieNeu-TTS · Hugging Face

VieNeu-TTS is an advanced on-device Vietnamese Text-to-Speech (TTS) model with instant voice cloning.

Hugging Face

huggingface.co › collections › doof-ferb › vietnamese-speech-dataset-65c6af8c15c9950537862fa6

Vietnamese speech dataset - a doof-ferb Collection

Vietnamese speech dataset · updated Jul 8 · for any speech-related tasks including but not limited to: speech-to-text & text-to-speech, speech classification, speaker verification, etc. Upvote · 26 · +16 · Updated Jun 14, 2023 • 256 • 13 · Viewer • Updated Apr 20 • 56.4k • 1.1k • 11 ·

Hugging Face

huggingface.co › doannguyenmmo › VI-TEXT-TO-SPEECH

doannguyenmmo/VI-TEXT-TO-SPEECH · Hugging Face

A compact fine-tuned version of F5-TTS trained on 100 hours of Vietnamese speech.

Hugging Face

huggingface.co › spaces › nam194 › text-to-speech

Lightweight Vietnamese text to speech - a Hugging Face Space by nam194

This app converts Vietnamese text into speech. Users input text, adjust sentence spacing and reading speed, and get synthesized audio output.

Hugging Face

huggingface.co › ntdgo › ttsvi

ntdgo/ttsvi · Hugging Face

viⓍTTS is a voice generation model that lets you clone voices into different languages by using just a quick 6-second audio clip. This model is fine-tuned from the XTTS-v2.0.3 model by expanding the tokenizer to Vietnamese and fine-tuning ...

Hugging Face

huggingface.co › spaces › toandev › F5-TTS-Vietnamese

F5-TTS-Vietnamese - a Hugging Face Space by toandev

This app turns Vietnamese text into speech using a reference audio sample. Users provide an audio file and the text they want to convert, and get a synthesized audio file and a spectrogram image as...

Hugging Face

huggingface.co › spaces › ntt123 › Vietnam-female-voice-TTS

Vietnam Female Voice TTS - a Hugging Face Space by ntt123

This application converts written Vietnamese text into speech using a female voice. Users input text, and the app outputs an audio clip of the text being read aloud.

Hugging Face

huggingface.co › facebook › tts_transformer-vi-cv7

facebook/tts_transformer-vi-cv7 · Hugging Face

Text-to-Speech · Fairseq · common_voice · Vietnamese · audio · arxiv: 1809.08895 · arxiv: 2109.06912 · Model card Files Files and versions · xet Community · Use this model · Transformer text-to-speech model from fairseq S^2 (paper/code): Vietnamese ·