WhisperNow: Voice Transcription Tool (Linux)

A voice transcription tool using faster-whisper that records audio and converts speech to text on Linux systems.

Features

Real-time audio recording using Linux's sox utility
Speech-to-text transcription using faster-whisper
Automatic clipboard copy of transcribed text (using wl-copy for Wayland)
Voice activity detection (VAD) to filter silence
Support for different whisper models (small.en, large-v3, distil-medium.en)

System Requirements

Linux operating system
Python >= 3.12

sox for audio recording:

# Ubuntu/Debian
sudo apt install sox
# Fedora
sudo dnf install sox

wl-clipboard for Wayland clipboard support:

# Ubuntu/Debian
sudo apt install wl-clipboard
# Fedora
sudo dnf install wl-clipboard

uv package manager

Installation

Install uv

curl -LsSf https://astral.sh/uv/install.sh | sh

Insall dependencies

# Ubuntu
sudo apt install sox wl-clipboard python3

Clone this repository

Usage

Run directly with Python:

OMP_NUM_THREADS=2 uv run transcribe.py

Or use the terminal launcher, which will open a terminal and run the script inside. Useful for sway hotkeys.

./run_in_terminal.sh

Usage Instructions

The program will start recording automatically
Press Enter to stop recording
Wait for transcription to complete
The transcribed text will be copied to clipboard automatically
Press Enter to record another message or 'q' + Enter to quit

Configuration

You can change the model size in transcribe.py:

# Available options:
model_size = "small.en"     # Faster, less accurate
# model_size = "large-v3"   # Slower, more accurate
# model_size = "distil-medium.en"  # Balanced performance

Notes

This tool is designed specifically for Linux systems running Wayland
For X11 systems, you'll need to modify the clipboard command from wl-copy to xclip
The transcribed audio files are temporarily stored in /tmp/recordings/

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
LICENSE		LICENSE
README.md		README.md
run_gui.sh		run_gui.sh
run_in_terminal.sh		run_in_terminal.sh
transcribe.py		transcribe.py
transcribe_gui.py		transcribe_gui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WhisperNow: Voice Transcription Tool (Linux)

Features

System Requirements

Installation

Usage

Usage Instructions

Configuration

Notes

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

WhisperNow: Voice Transcription Tool (Linux)

Features

System Requirements

Installation

Usage

Usage Instructions

Configuration

Notes

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages