Whisper Typer

Push-to-talk voice transcription using Faster-Whisper. Supports Windows, macOS, and Linux.

Quick Start

Start the app:
```
uv run run.py
```
In the app:
- The server auto-starts on launch.
- Choose a Model and Input Mode (Live or Full Capture).
- Use the Global Hotkey: Ctrl+Win (Windows) or Ctrl+Cmd (macOS).
- Hold keys to record, release to stop and transcribe.
- Quick Double-Tap to enter "Hands-free" mode (press again to stop).
- Text types into your active window automatically.

Installation

If you want to install it as a global tool:

uv pip install -e .
whisper-typer

Flow logic

%%{init: {"flowchart": {"htmlLabels": false}} }%%
flowchart TD
    A["User Hotkey"] --> B["Audio Input Stream"]
    C{"Input Mode"}
    C -->|Live typing| D["Silence-based Chunking"]
    C -->|Full Capture| E["Full Recording Capture"]
    D --> F["Transcription Queue (FIFO)"]

    E --> F
    F --> G["Server API (Transcribe)"]
    G --> H["Transcription Service"]
    H --> I["Text Output"]
    I --> J["Keyboard Typing to Active Window"]

User triggers hotkey (Ctrl+Win or Ctrl+Cmd).
Audio is captured from input stream.
App checks selected mode:
- Live typing → chunks split by silence windows and enqueued.
- Full Capture → all chunks captured until stop, then enqueued.
Queue processes each chunk in order (FIFO).
For each chunk:
- Send audio to server via API.
- Server returns transcribed text.
- Text is typed into the active window via keyboard simulation.

Hotkeys & Auto-typing

The client runs a global low-level hotkey listener:

Ctrl+Win (Windows) or Ctrl+Cmd (macOS).
Hold to Record: Recording stays active as long as keys are held. Releasing either key stops and triggers transcription.
Hands-free (Toggle): Double-tap the combo quickly to stay in recording mode after release. Tap again to stop.
When recording is stopped, the client waits for the transcription and then simulates keyboard typing to insert the text into the currently focused window.

macOS Users:

You must grant Accessibility permissions to your terminal (e.g., iTerm or Terminal.app) for the auto-typing to work.

Grant Microphone permissions when prompted.

System tray icon colors

State	Color	Meaning
Idle (server online)	🟢 Green	Server is running, ready to transcribe
Server offline	⚫ Black	Server is not reachable
Recording	🔴 Red	Audio is being captured
Processing	🟣 Purple	Transcribing audio

Requirements

OS: Windows, macOS, or Linux
Python: 3.10+
Package manager: uv (recommended)
Docker: Optional, for isolated container deployment

Configuration

The application stores data in ~/.whisper-typer/ by default. You can customize settings using a .env file in the project root:

WHISPER_MODEL: Default model (e.g., tiny, small, medium).
WHISPER_MODELS_DIR: Custom path for model storage. Use an absolute path (for example D:/AI/whisper-models on Windows or /absolute/path/to/models on Linux/macOS) so the client and server always use the same directory.
HF_TOKEN: Hugging Face token for private models.

Contributing

Contributions are welcome! Please see CONTRIBUTING.md for details.

License

This project is licensed under the Apache License 2.0. See the LICENSE file for details.

About the Author

Sharad Raj Singh Maurya
AI Engineer and Open Source enthusiast.

GitHub: @sharadcodes
Project: Whisper Typer

Feel free to reach out for collaborations or to report any issues!

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github		.github
packaging		packaging
whisper_typer		whisper_typer
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run.py		run.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisper Typer

Quick Start

Installation

Flow logic

Hotkeys & Auto-typing

System tray icon colors

Requirements

Configuration

Contributing

License

About the Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Whisper Typer

Quick Start

Installation

Flow logic

Hotkeys & Auto-typing

System tray icon colors

Requirements

Configuration

Contributing

License

About the Author

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages