Are there any free/open-source TTS options out there that are on the same level as Google Cloud’s? I tried a lot of free ones, but they are absolutely awful and still sound like my Amiga did 30 years ago. With LLMs being available as open source, I am hoping there’s also a good TTS offering I just haven’t found yet.
Piper is my choice. Very easy to use from the command line, fairly good sounding voices. Prior to that, for years (decades?) I used espeak-ng, had a very robotic voice but articulated almost everything very clearly, and I got used to it so didn’t actually mind.
Wow.
Came here to recommend Piper. It’s an excellent TTS engine.
Have you tried Piper?
Yes, but if you compare it to https://cloud.google.com/text-to-speech?hl=en (scroll down a bit and you can try it) and the Neural2 model, it sounds like shit. I mean, it’s great to see that there are efforts, but it just pales in comparison.
Well, it’s about as good as you’re going to get right now.
https://github.com/rsxdalv/tts-generation-webui and https://github.com/gitmylo/audio-webui. I use them all the time. Taking a sample of 10s i get amazing results.
Cool, I’ll give those a try!