Videos
I learned about a new voice changer that might interest others that I think sounds good. It's w-okada's voice changer, you can find it here: https://github.com/w-okada/voice-changer/blob/master/README_en.md
Here are some voice clips of me using it (keep in mind the voices that come with it are not english voices and so have an accent and don't sound great in english):
Voice #1: https://on.soundcloud.com/a6SSp
Voice #4: https://on.soundcloud.com/xo3Fp
My un-converted voice: https://on.soundcloud.com/mbywD
I found a good video of someone showcasing the app and explaining how to use it: https://www.youtube.com/watch?v=pHhjg2JwdPI
There's a discord where people share voices they've made that are compatible with w-okada's voice changer (go to the voice-models channel): https://discord.gg/aihub
Finding a voice that was made in the language you plan to speak in will result in better quality and pronunciation. So use an english-speaking voice for english, for example
But be careful because the files people share in the ai hub discord above could contain malware when run in the voice changer
If you're having trouble finding the download, you want to find the part that looks like this, and select the one for your operating system. There's two windows versions, if the first one doesn't work for you, try the second one. Also, if it doesn't let you download because too many people are accessing it, try the "hugging_face" link directly below the "normal" link that I circled and download it from there instead
(the "normal" link was renamed to "google" but it's the same thing just called differently now)
After you download it and unzip it, find and run the start_http.bat file (or startHttp.command if you're on mac) It will start downloading and installing everything it needs to run and then it should start
If you can't even get the app to start properly, try downloading a previous version from the list of downloads
If you see a screen like this, then select RVC
Things I learned that you might want to know to use the app:
As with most other ai-powered voice changers, you'll need a computer with a decent gpu for it to run. I'm not an expert but probably something like a GTX 1060 would be the minimum you'd need
It's very important to adjust the tune slider in the app depending on the voice, this will change the pitch of your voice to the ai so it does a good job of matching the voice you choose. So generally for male to female increase the pitch slider, and for female to male lower the pitch slider until you find an amount that sounds good. For Voice #1 that I recorded above I used a tune of 6, and for Voice #4 I used a tune of 12
Increase the "chunk" and "extra" setting for better conversion quality, but at a cost of increased latency. I used a chunk of 512 and an extra of 32768 for the voice clips I recorded above
Some of the voices that you'll find in the discord I linked above will be worse quality than others because some of them were created with lower resources, so keep in mind that some voices people share could sound much much better or much much worse than others
Many buttons and settings in the app take time for it to process, so sometimes there will be a delay after changing a setting or switching a voice before it'll do it, don't worry that's normal
I changed the audio from client to server at the bottom of the app. I had problems on client mode
Hello i've been running W-Okada for awhile now and actually love it, nothing I've found beats it and everywhere i look everyone still says it's the best.
I just want to know what's the newest or "best" considered version right now? Currently i run v.1.5.3.18a.
I mess with a lotta AI stuff besides the voice changer so am used to keeping those programs up to date (or switching to a more powerful/consistent program) but don't really know how to keep track of W-Okada (which i understand is mostly do to the fact it's a program developed in Asia).
If anyone knows any updates or better versions please let me know or direct me as well as i hope any future people looking for the same info might also know. Thank you!
I'm trying to follow this tutorial: https://github.com/w-okada/voice-changer/blob/master/README_en.md
But I know absolutely nothing about code or anything like that. It says it supports Linux, but only has downloads for Mac and windows can someone help me download it for Linux? Maybe dumb it down for me please
i recently bought a voice model (which is a .ptn file btw) and wanted to use it, instructions said to use with okada's voice changer but turns out its a havers heavy program that is rec to need at least rtx 3080 just to use it decently, which is crazy, why does it need that much gpu too???
now ik there's rvc apps like voicemod and voice.ai, but they dont allow importing your own voice models. i want a low end version of okada voice changer. i've tried koemake, which was advertised to be free and is low latency (confirmed to work with gtx 1650 and above) so its great for me, only for me to find out that i need to pay in order usd6 monthly subscription to use it after downloading??? ngl I've never been so pissed off towards an app before
so yeah, i hope theres at least one rvc that works for low end users
so far the voice that has worked best for me (in terms of voice quality) is a gawr gura voice, but it's a little cartoony and I feel like if I introduced myself to people online with it they'd probably ask why I sound like a popular e-celebrity. and so far, it seems like all the quality models are of very famous people with very distinct voices (which makes sense).
it's difficult for me because my voice is naturally very low and I have a bad habit of speaking with a lot of vocal fry.
so what can I do to find quality but more generic voices? pick some real life celebrity that sounds close-ish enough to me but is also popular enough to have had a quality model made of her?
does anyone have some resources or a list of quality generic models that actually sound good?
Hey, I found this Github with a voice changer that I want to try out, but while it does have a .sh and both docker and annaconda capabilities, I'm unfamilliar with both and don't think that I can get it set up. I use Nobara as my daily driver, and have an amd GPU. Would anybody be willing to assist me.
https://github.com/w-okada/voice-changer/tree/master
I've tried seemingly all the big voice changers and they just aren't good. They sound robotic or they don't run on my computer because my GPU isn't that great.
Does anyone have any recommendations?
From what I see, most tools claiming to change your voice actually just convert your speech into text, and then that text back into an AI voice. You loose expression doing it this way, and it sounds a bit false.
It'd be super handy to retain the subtle inflections and performance of a talk, something mostly lost in "text to ai voice".
(and then the next question would be to run it locally!)
Would be good for YouTube channels.