-
Notifications
You must be signed in to change notification settings - Fork 227
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for OpenAI Whisper #687
Conversation
I did notice that about the tests but I wasn't too sure how to handle that: anything you want me to do in this PR? |
Yes you can try the proposed changes. |
Done, waiting on tests. I also accidentally added a commit for a second feature (FasterWhisper) which I force-pushed away, but that requires an upstream change to |
And looks like that solved it, I suppose we can just keep that commit in here, I've updated the description. |
Thx. You can rebase your other PR so I can merge it |
Is this keeping the whisper model loaded [or is it loaded each call]? Mine seems slow even with tiny being used. |
Oh, this is great, btw! Thanks. :)) |
@jaggzh I believe that SpeechRecognition is loading it each time, which is indeed very slow. I ended up abandoning this myself, and using a custom integration with my own https://github.com/joshuaboniface/remote-faster-whisper tool. |
Your flask secret looks a lot more developed than my flask server :)
https://github.com/jaggzh/whisperpluck
(This project was for someone to use a GUI for transcription though. Server
to keep a whisper model loaded.)
…On Tue, Oct 31, 2023, 10:27 PM Joshua M. Boniface ***@***.***> wrote:
@jaggzh <https://github.com/jaggzh> I believe that SpeechRecognition is
loading it each time, which is indeed very slow. I ended up abandoning this
myself, and using a custom integration with my own
https://github.com/joshuaboniface/remote-faster-whisper tool.
—
Reply to this email directly, view it on GitHub
<#687 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AE3AV7MO37LKMIVTEGDCR6LYCHMT5AVCNFSM6AAAAAAYZXR72OVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTOOBYGQZTEMZRGM>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Adds basic support for OpenAI Whisper (local) as an STT provider.
This is supported by the upstream SpeechRecognition library, so this is just the standard translation to/from Kalliope.
Several options have been implemented and documented, including the ability to "unformat" the resulting strings if desired.
Also caps the Ansible version below 5 to fix test failures.