-
Notifications
You must be signed in to change notification settings - Fork 227
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Whisper support #686
Comments
#687 adds this! |
#687 was merged, so the next release should have (core) Whisper support. Next step is FasterWhisper, which is indeed much faster. This is waiting on the upstream PR Uberi/speech_recognition#693 to implement in |
|
Oh, this happens after I say Kaliope (during STT?) |
Oh, sorry, I was in master; I noticed in your PR it's in dev. That's working. :)) |
OpenAI-Whisper is a really high quality locally-run speech-to-text project. (It can also translate; but nevermind that for now).
It would be awesome if we could get that in here, even for the trigger word recognition.
I started a separate project to use it for live transcription. Whisper runs as a single command, or from python. My project includes a little python flask app that keeps the model loaded, takes a wav file (POSTed), and returns just the text.
https://github.com/jaggzh/whisperpluck
Before I do any effort to implement it in Kalliope I wanted to run it by you all. (I'm quite busy so I don't think I'll even be able to get to it for now. I coded whisperpluck for a friend who has issues using the computer).
The text was updated successfully, but these errors were encountered: