mywhisper-gtk: minimal Whisper voice input for Linux apps #2740
phplego
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi all,
I built a small Linux desktop app around Whisper for voice input into different applications, including terminal apps:
https://github.com/phplego/mywhisper
The workflow is simple:
Left Ctrlto start and stop recordingEsccancels the current recordingThis is not an offline tool and it currently depends on an OpenAI API key set in the app settings.
I am sharing it here mostly because Ubuntu/X11 is the environment where I have actually used it.
The main thing I am trying to understand is how this kind of tool should work on Wayland.
On X11, global hotkeys and sending text to the active application are manageable.
On Wayland, the expected and acceptable way to do that is much less obvious.
If anyone here has experience shipping or using dictation tools on Ubuntu Wayland, I would be interested in pointers on:
I realize there are more mature dictation tools already, including local ones.
I am sharing this because the narrow use case may still be useful to someone, not because I think it replaces those projects.
Thank you in advance!
Beta Was this translation helpful? Give feedback.
All reactions