| # Awesome Whisper Apps | |
| A curated collection of applications, tools, and resources built with [OpenAI Whisper](https://github.com/openai/whisper) - a robust automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data. | |
| ## Table of Contents | |
| - [Quick Start Guide](#quick-start-guide) | |
| - [Popular Picks](#popular-picks) | |
| - [Getting Started](#getting-started) | |
| - [By Use Case](#by-use-case) | |
| - [By Platform](#by-platform) | |
| - [For Developers](#for-developers) | |
| - [Resources](#resources) | |
| ## Quick Start Guide | |
| **Looking for something specific?** | |
| - **Voice typing on Linux?** β [Linux System Integration](#linux-system-integration) or try [nerd-dictation](#nerd-dictation) | |
| - **Voice typing on Mac?** β [macOS Apps](#macos) or try [SuperWhisper](#superwhisper) | |
| - **Voice typing on Windows?** β [Windows Apps](#windows) or try [WinWhisper](#winwhisper) | |
| - **Cross-platform desktop app?** β Try [Buzz](#buzz) or [whisper-writer](#whisper-writer) | |
| - **Generate video subtitles?** β [Subtitles & Captioning](#srt--subtitles--captioning) | |
| - **Real-time transcription?** β [Real-Time & Streaming](#real-time--streaming) | |
| - **Meeting transcription?** β [Meeting & Productivity](#meeting--productivity) | |
| - **Cloud/SaaS solution?** β [SaaS Platforms](#saas-platforms) | |
| - **Self-hosted web interface?** β [Web UI](#web-ui) | |
| - **Mobile app?** β [Android](#android) or [iOS](#ios) | |
| - **Developer integration?** β [Libraries & APIs](#libraries--apis) or [Model Variants](#model-variants) | |
| ## Popular Picks | |
| Top projects by community engagement and activity: | |
| ### Desktop Applications | |
| | Project | Platform | Stars | Description | | |
| |---------|----------|-------|-------------| | |
| | [Buzz](#buzz) | Cross-platform |  | Feature-rich desktop transcription app | | |
| | [whisper-writer](#whisper-writer) | Cross-platform |  | Voice-to-text for system-wide input | | |
| | [SuperWhisper](#superwhisper) | macOS | N/A | Premium Mac app for voice-to-text | | |
| | [WinWhisper](#winwhisper) | Windows |  | System-wide hotkey support for Windows | | |
| ### Model Variants & Performance | |
| | Project | Stars | Description | | |
| |---------|-------|-------------| | |
| | [whisper.cpp](#whispercpp) |  | High-performance C/C++ implementation | | |
| | [faster-whisper](#faster-whisper) |  | Faster implementation using CTranslate2 | | |
| | [WhisperX](#whisperx) |  | Word-level timestamps + speaker diarization | | |
| | [insanely-fast-whisper](#insanely-fast-whisper) |  | Speed-optimized implementation | | |
| ### Developer Tools | |
| | Project | Stars | Description | | |
| |---------|-------|-------------| | |
| | [WhisperLive](#whisperlive) |  | Real-time transcription server | | |
| | [whisper_streaming](#whisper_streaming) |  | Long-form streaming transcription | | |
| | [Whisper-WebUI](#whisper-webui) |  | Self-hosted web interface | | |
| ## Getting Started | |
| ### Official Whisper & Models | |
| **Official Repository:** [openai/whisper](https://github.com/openai/whisper)  | |
| **Hugging Face Collection:** [Whisper Model Release](https://huggingface.co/collections/openai/whisper-release) | |
| **Official Paper:** [Robust Speech Recognition via Large-Scale Weak Supervision](https://arxiv.org/abs/2212.04356) | |
| ### Official Model Sizes | |
| Choose based on your accuracy/speed requirements: | |
| | Model | Parameters | English-only | Multilingual | Relative Speed | Use Case | | |
| |-------|------------|--------------|--------------|----------------|----------| | |
| | [tiny](https://huggingface.co/openai/whisper-tiny) | 39M | β | β | Fastest | Minimal resource usage, real-time apps | | |
| | [base](https://huggingface.co/openai/whisper-base) | 74M | β | β | Very Fast | Resource-constrained environments | | |
| | [small](https://huggingface.co/openai/whisper-small) | 244M | β | β | Fast | Good balance for most use cases | | |
| | [medium](https://huggingface.co/openai/whisper-medium) | 769M | β | β | Moderate | Better accuracy, moderate speed | | |
| | [large](https://huggingface.co/openai/whisper-large) | 1550M | - | β | Slower | Best accuracy, research use | | |
| --- | |
| ## By Use Case | |
| ### Voice Typing & Dictation | |
| **Cross-Platform:** | |
| - [Buzz](#buzz) - Feature-rich desktop app | |
| - [whisper-writer](#whisper-writer) - System-wide voice-to-text | |
| - [whisper-dictation](#whisper-dictation) - Dictation application | |
| **Linux:** | |
| - [nerd-dictation](#nerd-dictation) - Hackable offline speech-to-text | |
| - [BlahST](#blahst) - Linux speech-to-text integration | |
| - [whisper-to-input](#whisper-to-input) - Convert transcription to keyboard input | |
| - [voice-typing-linux](#voice-typing-linux) - Voice typing integration | |
| **macOS:** | |
| - [SuperWhisper](#superwhisper) - Premium Mac voice-to-text app | |
| - [OpenSuperWhisper](#opensuperwhisper) - Open-source Mac app | |
| - [WhisperKit](#whisperkit) - Native macOS implementation | |
| **Windows:** | |
| - [WinWhisper](#winwhisper) - System-wide hotkey support | |
| - [Whisper Typing for Windows](#whisper-typing-for-windows) - Desktop voice typing | |
| **Mobile:** | |
| - [whisperIME](#whisperime) (Android) - Input method editor | |
| - [Whisperboard](#whisperboard) (iOS) - Keyboard with Whisper | |
| ### SaaS Platforms & Cloud Services | |
| - [Whisper Transcribe](https://www.whispertranscribe.com/) - Online transcription platform | |
| - [WhisperAI](https://whisperai.com) - Cloud-based transcription service | |
| - [Whisper Typing](https://whispertyping.com/) - Online typing and transcription | |
| - [Wisprflow](https://wisprflow.ai/) - Workflow automation with transcription | |
| - [CleverType](https://www.clevertype.co/) - Smart typing assistant | |
| - [SpeechPulse](https://speechpulse.com/) - Cross-platform speech-to-text | |
| - [Blabby.ai](https://www.blabby.ai/) - Browser-based transcription | |
| ### Subtitles & Captioning | |
| Generate subtitles and captions for videos: | |
| - **[auto-subs](https://github.com/tmoroney/auto-subs)**  - Automatic subtitle generation | |
| - **[TeroSubtitler](https://github.com/URUWorks/TeroSubtitler)**  - Professional subtitle editor | |
| - **[whisper-youtube](https://github.com/ArthurFDLR/whisper-youtube)**  - YouTube subtitle generation | |
| - **[yt-whisper](https://github.com/m1guelpf/yt-whisper)**  - YouTube transcription tool | |
| - **[whisper-subs](https://github.com/GhostNaN/whisper-subs)**  - CLI for adding subtitles to videos | |
| - **[whisply](https://github.com/tsmdt/whisply)**  - Automatic subtitle generation (Linux) | |
| - **[template-tiktok](https://github.com/remotion-dev/template-tiktok)**  - TikTok-style captioning with Remotion | |
| ### Meeting & Productivity | |
| Tools for transcribing meetings and generating notes: | |
| - **[meeting-minutes](https://github.com/Zackriya-Solutions/meeting-minutes)**  - Generate meeting minutes | |
| - **[ScribeWizard](https://github.com/Bklieger/ScribeWizard)**  - AI-powered note-taking | |
| ### Web Interfaces | |
| **Self-Hosted:** | |
| - **[Whisper-WebUI](https://github.com/jhj0517/Whisper-WebUI)**  - Web interface for transcription | |
| - **[NeuroSandboxWebUI](https://github.com/Dartvauder/NeuroSandboxWebUI)**  - Comprehensive web UI for AI models | |
| --- | |
| ## By Platform | |
| ### Cross-Platform Desktop Applications | |
| Applications that work on Linux, macOS, and Windows: | |
| | Project | Stars | Description | | |
| |---------|-------|-------------| | |
| | [Buzz](https://github.com/chidiwilliams/buzz) |  | Feature-rich transcription app | | |
| | [whisper-writer](https://github.com/savbell/whisper-writer) |  | Voice-to-text application | | |
| | [faster-whisper-GUI](https://github.com/CheshireCC/faster-whisper-GUI) |  | GUI for faster-whisper | | |
| | [SoftWhisper](https://github.com/NullMagic2/SoftWhisper) |  | User-friendly GUI | | |
| | [speech-assistant](https://github.com/Mohamad-Hussein/speech-assistant) |  | Speech assistant GUI | | |
| | [whisper-dictation](https://github.com/foges/whisper-dictation) |  | Dictation application | | |
| | [whisper-realtime-gui](https://github.com/phongthanhbuiit/whisper-realtime-gui) |  | Real-time transcription GUI | | |
| | [whisper-ui](https://github.com/schnoddelbotz/whisper-ui) |  | Cross-platform desktop UI | | |
| | [whisper_dictation](https://github.com/themanyone/whisper_dictation) |  | Voice dictation tool | | |
| | [WhisperGUI](https://github.com/ADT109119/WhisperGUI) |  | Simple GUI | | |
| ### Linux | |
| #### Desktop Applications | |
| - **[froshine](https://github.com/AdrianScott/froshine)**  - Linux desktop app | |
| - **[speak-to-ai](https://github.com/AshBuk/speak-to-ai)**  - Voice interaction app | |
| - **[Whisper-Notepad-For-Linux](https://github.com/danielrosehill/Whisper-Notepad-For-Linux)**  - Notepad-style transcription | |
| - **[WhisperNow](https://github.com/shinglyu/WhisperNow)**  - Desktop application | |
| #### CLI Tools | |
| - **[whisper.cpp-cli](https://github.com/charliermarsh/whisper.cpp-cli)**  - CLI for whisper.cpp | |
| - **[blurt](https://github.com/QuantiusBenignus/blurt)**  - Command-line transcription tool | |
| #### System Integration | |
| - **[nerd-dictation](https://github.com/ideasman42/nerd-dictation)**  - Hackable offline STT (VOSK-API) | |
| - **[BlahST](https://github.com/QuantiusBenignus/BlahST)**  - Speech-to-text integration | |
| - **[Linux-Dictation-Project](https://github.com/wheeler01/Linux-Dictation-Project)**  - Dictation system | |
| - **[linux-stt-input](https://github.com/fengwk/linux-stt-input)**  - STT input method | |
| - **[linux-voice-to-text-ai](https://github.com/trebormc/linux-voice-to-text-ai)**  - Voice-to-text AI | |
| - **[LinuxWhisper](https://github.com/vitali87/LinuxWhisper)**  - Linux implementation | |
| - **[voice-typing-linux](https://github.com/GitJuhb/voice-typing-linux)**  - Voice typing integration | |
| - **[Whisper-Dictation](https://github.com/LumenYoung/Whisper-Dictation)**  - Dictation system | |
| - **[whisper-flow-linux](https://github.com/sapountzis/whisper-flow-linux)**  - Workflow integration | |
| - **[whisper-hotkey-linux](https://github.com/atkvishnu/whisper-hotkey-linux)**  - Hotkey-based integration | |
| - **[whispertrigger](https://github.com/RetroTrigger/whispertrigger)**  - System integration | |
| - **[whisprd](https://github.com/AgenticToaster/whisprd)**  - Whisper daemon | |
| - **[whisper-to-input](https://github.com/j3soon/whisper-to-input)**  - Transcription to keyboard input | |
| - **[whispy](https://github.com/daaku/whispy)**  - Integration tool | |
| - **[dicti](https://github.com/tksimson/dicti)**  - Dictation tool | |
| - **[sonori](https://github.com/0xPD33/sonori)**  - Voice input system | |
| - **[hushnote](https://github.com/peteonrails/hushnote)**  - Private note-taking | |
| - **[Local-Voice](https://github.com/shashank2122/Local-Voice)**  - Local voice processing | |
| - **[s2t](https://github.com/franchesoni/s2t)**  - Speech-to-text | |
| - **[Whisper-Notepad-Simple](https://github.com/danielrosehill/Whisper-Notepad-Simple)**  - Simple notepad app | |
| - **[Linux-AI-Assistant-scripts](https://github.com/samoylenkodmitry/Linux-AI-Assistant-scripts)**  - AI assistant scripts | |
| ### macOS | |
| #### Desktop Applications | |
| - **[SuperWhisper](https://superwhisper.com/)** - Premium Mac voice-to-text app | |
| - **[OpenSuperWhisper](https://github.com/Starmel/OpenSuperWhisper)**  - Open-source Mac app | |
| - **[WhisperKit](https://github.com/argmaxinc/WhisperKit)**  - Native macOS implementation | |
| - **[Careless Whisper](https://carelesswhisper.app/)** - Lightweight transcription app | |
| #### System Integration | |
| - **[ollama-voice-mac](https://github.com/apeatling/ollama-voice-mac)**  - Voice interface for Ollama | |
| - **[whisperanywhere-js](https://github.com/unclecode/whisperanywhere-js)**  - System-wide transcription | |
| --- | |
| ### Windows | |
| #### Desktop Applications | |
| - **[AI Transcription](https://apps.microsoft.com/detail/9p7f1j2svk3g)** - Microsoft Store app | |
| - **[Whisper Typing for Windows](https://whispertyping.com/download)** - Desktop voice typing | |
| #### System Integration | |
| - **[WinWhisper](https://github.com/GewoonJaap/WinWhisper)**  - System-wide hotkey support | |
| --- | |
| ### Android | |
| - **[whisperIME](https://github.com/woheller69/whisperIME)**  - Input method editor | |
| - **[WhisperInput](https://github.com/alex-vt/WhisperInput)**  - Input app | |
| - **[WhisperKitAndroid](https://github.com/argmaxinc/WhisperKitAndroid)**  - WhisperKit for Android | |
| - **[RTranslator](https://github.com/niedev/RTranslator)**  - Real-time translation app | |
| - **[Dictate](https://github.com/DevEmperor/Dictate)**  - Voice dictation app | |
| - **[whisper_android](https://github.com/vilassn/whisper_android)**  - Android integration | |
| --- | |
| ### iOS | |
| - **[Whisperboard](https://github.com/Saik0s/Whisperboard)**  - iOS keyboard with Whisper integration | |
| --- | |
| ### Embedded / Raspberry Pi | |
| - **[Local-Voice](https://github.com/shashank2122/Local-Voice)**  - Local voice processing for embedded systems | |
| --- | |
| ## For Developers | |
| ### Model Variants & Performance Optimizations | |
| ### pyannote-whisper | |
|   | |
| Integration of Whisper with pyannote for speaker diarization | |
| **Repository:** [https://github.com/yinruiqing/pyannote-whisper](https://github.com/yinruiqing/pyannote-whisper) | |
| --- | |
| ### WhisperChain | |
|   | |
| Pipeline framework for Whisper-based workflows | |
| **Repository:** [https://github.com/chrischoy/WhisperChain](https://github.com/chrischoy/WhisperChain) | |
| --- | |
| ## IDE & Editor Integrations | |
| ### VS Code | |
| ### Whisper Assistant | |
| Whisper voice-to-text integration for VS Code | |
| **Repository:** [https://marketplace.visualstudio.com/items?itemName=MartinOpenSky.whisper-assistant](https://marketplace.visualstudio.com/items?itemName=MartinOpenSky.whisper-assistant) | |
| --- | |
| ### Yap - Cursor Extension | |
| Voice input extension for VS Code and Cursor editor | |
| **Repository:** [https://marketplace.visualstudio.com/items?itemName=rishabhsai.yap-cursor-extension](https://marketplace.visualstudio.com/items?itemName=rishabhsai.yap-cursor-extension) | |
| --- | |
| ### WhisperX Assistant | |
| WhisperX integration for VS Code with enhanced features | |
| **Repository:** [https://marketplace.visualstudio.com/items?itemName=mwhesse.whisperx-assistant](https://marketplace.visualstudio.com/items?itemName=mwhesse.whisperx-assistant) | |
| --- | |
| ### Obsidian | |
| ### whisper-obsidian-plugin | |
|   | |
| Whisper integration for Obsidian note-taking app | |
| **Repository:** [https://github.com/nikdanilov/whisper-obsidian-plugin](https://github.com/nikdanilov/whisper-obsidian-plugin) | |
| --- | |
| ### Note-Taking & Productivity | |
| ### ScribeWizard | |
|   | |
| AI-powered note-taking with Whisper transcription | |
| **Repository:** [https://github.com/Bklieger/ScribeWizard](https://github.com/Bklieger/ScribeWizard) | |
| --- | |
| ## Game Engines & Development Platforms | |
| ### Unity | |
| ### whisper.unity | |
|   | |
| Whisper integration for Unity game engine | |
| **Repository:** [https://github.com/Macoron/whisper.unity](https://github.com/Macoron/whisper.unity) | |
| --- | |
| ## Playgrounds & Demos | |
| ### whisper-playground | |
|   | |
| Interactive playground for experimenting with Whisper | |
| **Repository:** [https://github.com/saharmor/whisper-playground](https://github.com/saharmor/whisper-playground) | |
| --- | |
| ## SRT / Subtitles & Captioning | |
| ### auto-subs | |
|   | |
| Automatic subtitle generation with Whisper | |
| **Repository:** [https://github.com/tmoroney/auto-subs](https://github.com/tmoroney/auto-subs) | |
| --- | |
| ### template-tiktok | |
|   | |
| TikTok-style captioning with Whisper integration using Remotion | |
| **Repository:** [https://github.com/remotion-dev/template-tiktok](https://github.com/remotion-dev/template-tiktok) | |
| --- | |
| ### TeroSubtitler | |
|   | |
| Professional subtitle editor with Whisper integration | |
| **Repository:** [https://github.com/URUWorks/TeroSubtitler](https://github.com/URUWorks/TeroSubtitler) | |
| --- | |
| ### whisper-subs | |
|   | |
| CLI tool for adding subtitles to videos using Whisper | |
| **Repository:** [https://github.com/GhostNaN/whisper-subs](https://github.com/GhostNaN/whisper-subs) | |
| --- | |
| ### whisper-youtube | |
|   | |
| Generate subtitles from YouTube videos using Whisper | |
| **Repository:** [https://github.com/ArthurFDLR/whisper-youtube](https://github.com/ArthurFDLR/whisper-youtube) | |
| --- | |
| ### whisply | |
|   | |
| Linux tool for automatic subtitle generation | |
| **Repository:** [https://github.com/tsmdt/whisply](https://github.com/tsmdt/whisply) | |
| --- | |
| ### yt-whisper | |
|   | |
| YouTube subtitle generation with Whisper | |
| **Repository:** [https://github.com/m1guelpf/yt-whisper](https://github.com/m1guelpf/yt-whisper) | |
| --- | |
| ## Deployment & Containers | |
| ### cog-whisper | |
|   | |
| Cog container for deploying Whisper models | |
| **Repository:** [https://github.com/replicate/cog-whisper](https://github.com/replicate/cog-whisper) | |
| --- | |
| ## Meeting & Productivity | |
| ### meeting-minutes | |
|   | |
| Generate meeting minutes using Whisper transcription | |
| **Repository:** [https://github.com/Zackriya-Solutions/meeting-minutes](https://github.com/Zackriya-Solutions/meeting-minutes) | |
| --- | |
| ## Miscellaneous | |
| ### whisper-turbo | |
|   | |
| High-performance Whisper implementation | |
| **Repository:** [https://github.com/FL33TW00D/whisper-turbo](https://github.com/FL33TW00D/whisper-turbo) | |
| --- | |
| ## Resources | |
| ### Official | |
| - [OpenAI Whisper Paper](https://arxiv.org/abs/2212.04356) | |
| ### Tutorials & Guides | |
| ### Model Variants | |
| ### CrisperWhisper | |
|   | |
| Enhanced Whisper variant for improved accuracy | |
| **Repository:** [https://github.com/nyrahealth/CrisperWhisper](https://github.com/nyrahealth/CrisperWhisper) | |
| --- | |
| ### distil-whisper | |
|   | |
| Distilled Whisper models from Hugging Face | |
| **Repository:** [https://github.com/huggingface/distil-whisper](https://github.com/huggingface/distil-whisper) | |
| --- | |
| ### faster-whisper | |
|   | |
| Faster Whisper implementation using CTranslate2 | |
| **Repository:** [https://github.com/SYSTRAN/faster-whisper](https://github.com/SYSTRAN/faster-whisper) | |
| --- | |
| ### insanely-fast-whisper | |
|   | |
| Optimized Whisper implementation for speed | |
| **Repository:** [https://github.com/Vaibhavs10/insanely-fast-whisper](https://github.com/Vaibhavs10/insanely-fast-whisper) | |
| --- | |
| ### whisper.cpp | |
|   | |
| High-performance C/C++ implementation | |
| **Repository:** [https://github.com/ggerganov/whisper.cpp](https://github.com/ggerganov/whisper.cpp) | |
| --- | |
| ### whisper.net | |
|   | |
| .NET implementation of Whisper | |
| **Repository:** [https://github.com/sandrohanea/whisper.net](https://github.com/sandrohanea/whisper.net) | |
| --- | |
| ### WhisperX | |
|   | |
| Whisper with word-level timestamps and speaker diarization | |
| **Repository:** [https://github.com/m-bain/whisperX](https://github.com/m-bain/whisperX) | |
| --- | |
| ### Real-Time & Streaming | |
| ### whisper-flow | |
|   | |
| Real-time Whisper transcription flow | |
| **Repository:** [https://github.com/dimastatz/whisper-flow](https://github.com/dimastatz/whisper-flow) | |
| --- | |
| ### whisper_real_time | |
|   | |
| Real-time Whisper transcription implementation | |
| **Repository:** [https://github.com/davabase/whisper_real_time](https://github.com/davabase/whisper_real_time) | |
| --- | |
| ### whisper_streaming | |
|   | |
| Whisper for long-form streaming transcription | |
| **Repository:** [https://github.com/ufal/whisper_streaming](https://github.com/ufal/whisper_streaming) | |
| --- | |
| ### WhisperLive | |
|   | |
| Real-time transcription using Whisper | |
| **Repository:** [https://github.com/collabora/WhisperLive](https://github.com/collabora/WhisperLive) | |
| --- | |
| ### Diarization & Timestamps | |
| ### cog-whisper-diarization | |
|   | |
| Cog-wrapped Whisper with diarization | |
| **Repository:** [https://github.com/thomasmol/cog-whisper-diarization](https://github.com/thomasmol/cog-whisper-diarization) | |
| --- | |
| ### whisper-diarization | |
|   | |
| Whisper with speaker diarization | |
| **Repository:** [https://github.com/MahmoudAshraf97/whisper-diarization](https://github.com/MahmoudAshraf97/whisper-diarization) | |
| --- | |
| ### whisper-timestamped | |
|   | |
| Word-level timestamps for Whisper | |
| **Repository:** [https://github.com/linto-ai/whisper-timestamped](https://github.com/linto-ai/whisper-timestamped) | |
| --- | |
| ### WhisperTimeSync | |
|   | |
| Time synchronization and diarization for Whisper | |
| **Repository:** [https://github.com/EtienneAb3d/WhisperTimeSync](https://github.com/EtienneAb3d/WhisperTimeSync) | |
| --- | |
| ### Fine-Tuning | |
| ### Whisper-Finetune | |
|   | |
| Utilities for fine-tuning Whisper models | |
| **Repository:** [https://github.com/yeyupiaoling/Whisper-Finetune](https://github.com/yeyupiaoling/Whisper-Finetune) | |
| --- | |
| ### whisper-finetuning | |
|   | |
| Fine-tuning framework for Whisper models | |
| **Repository:** [https://github.com/jumon/whisper-finetuning](https://github.com/jumon/whisper-finetuning) | |
| --- | |
| ### Awesome Lists | |
| - [awesome-openai-whisper (ancs21)](https://github.com/ancs21/awesome-openai-whisper) | |
| - [awesome-whisper (saharmor)](https://github.com/saharmor/awesome-whisper) | |
| - [awesome-whisper (oenu)](https://github.com/oenu/awesome-whisper) | |