Are there any free/open-source TTS options out there that are on the same level as Google Cloud’s? I tried a lot of free ones, but they are absolutely awful and still sound like my Amiga did 30 years ago. With LLMs being available as open source, I am hoping there’s also a good TTS offering I just haven’t found yet.
Piper is my choice. Very easy to use from the command line, fairly good sounding voices. Prior to that, for years (decades?) I used espeak-ng, had a very robotic voice but articulated almost everything very clearly, and I got used to it so didn’t actually mind.
Wow.
Came here to recommend Piper. It’s an excellent TTS engine.
https://github.com/rsxdalv/tts-generation-webui and https://github.com/gitmylo/audio-webui. I use them all the time. Taking a sample of 10s i get amazing results.
Cool, I’ll give those a try!
Have you tried Piper?
Yes, but if you compare it to https://cloud.google.com/text-to-speech?hl=en (scroll down a bit and you can try it) and the Neural2 model, it sounds like shit. I mean, it’s great to see that there are efforts, but it just pales in comparison.
Well, it’s about as good as you’re going to get right now.