k110111@feddit.de to

Open Source@lemmy.ml · 8 months ago

Any speech to text library that uses whisper api?

5

19

Any speech to text library that uses whisper api?

k110111@feddit.de to

Open Source@lemmy.ml · 8 months ago

5

So I have the api working as in I can send audio files and get text back but what I am looking for is a robust way to have streaming functionality. For example, if there is a small duration of silence it should stop recording and send the audio to api etc.

Is there any such library in python?

Chat

Sims@lemmy.ml
link
fedilink
English
arrow-up
3·
8 months ago
Just stumbled upon this speedy one: https://github.com/sanchit-gandhi/whisper-jax

And this one for word precision time marks: https://github.com/m-bain/whisperX

Open Source@lemmy.ml

opensource@lemmy.ml

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !opensource@lemmy.ml

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Posts must be relevant to the open source ideology
No NSFW content
No hate speech, bigotry, etc

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

784 users / day
1.85K users / week
4.11K users / month
9.15K users / 6 months
1 local subscriber
31K subscribers
1.45K Posts
16K Comments
Modlog