Fast Whisper WebUI NEW Update
huggingface.co spaces
Fast Whisper WebUI https://huggingface.co/spaces/gobeldan/Fast-Whisper-Small-Webui
Clone repository
(1)git clone https://huggingface.co/spaces/gobeldan/Fast-Whisper-Small-Webui (2) cd Fast-Whisper-Small-Webui
Before you clone or download this code on your PC, let me make it clear to you that the coding and software that I have released on GitHub is from Hugging Face. I am already making this clear. The URL is given above. Even if it doesn't work, it is still released on GitHub. I am already making this clear to you. Here is how to install it: First you have to download it, right? Then, as I have given you the details of the files, you have to tell ChatGPT that description and ask it, "Tell me how to install it." It will tell you how. Like, I will tell you.
CPU / GPU Requirements (VIP Info)
Models Best Name
CPU (small) β 461 MB
CPU/GPU (medium) β 1.42 GB
4 VRAM GPU β (Systran/faster-whisper-large-v1) β 3.09 GB
Fast Whisper WebUI
A total of 107 languages ββare listed:
English, Urdu, Hindi, Afrikaans, Albanian, Amharic, Arabic, Armenian, Assamese, Azerbaijani, Bashkir, Basque, Belarusian, Bengali, Bosnian, Breton, Bulgarian, Burmese, Castilian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, Estonian, Faroese, Finnish, Flemish, French, Galician, Georgian, German, Greek, Gujarati, Haitian, Haitian Creole, Hausa, Hawaiian, Hebrew, Hungarian, Icelandic, Indonesian, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Korean, Lao, Latin, Latvian, Letzeburgesch, Lingala, Lithuanian, Luxembourgish, Macedonian, Malagasy, Malay, Malayalam, Maltese, Mandarin, Maori, Marathi, Moldavian, Moldovan, Mongolian, Myanmar, Nepali, Norwegian, Nynorsk, Occitan, Panjabi, Pashto, Persian, Polish, Portuguese, Punjabi, Pushto, Romanian, Russian, Sanskrit, Serbian, Shona, Sindhi, Sinhala, Sinhalese, Slovak, Slovenian, Somali, Spanish, Sundanese, Swahili, Swedish, Tagalog, Tajik, Tamil, Tatar, Telugu, Thai, Tibetan, Turkish, Turkmen, Ukrainian, Uzbek, Valencian, Vietnamese, Welsh, Yiddish, Yoruba
Audio Transcription
SRT to Text
Remove timers
Clean Text - Remove Special Characters
File Uploader
π Audio Transcription & Text Processing Toolkit
π What This Tool Does:
1. Audio Transcription
Β· Convert Speech to Text - Transcribe audio files to written text Β· Multiple Input Sources - Upload files, record microphone, or use URLs Β· Multi-language Support - 107 languages including Urdu, English, Arabic, Hindi Β· Generate Subtitles - Create SRT files automatically Β· Offline Processing - Works completely offline with local AI models
2. Text Processing & Cleaning
Β· Remove Special Characters - Clean text from unwanted symbols Β· Timer Removal - Extract pure text from subtitle files (remove timestamps) Β· Format Conversion - Convert text to TXT, HTML, JSON, XML, CSV formats Β· File Reading - Read content from various file types (PDF, SRT, DOC, etc.)
3. File Management
Β· Automatic Organization - Saves all outputs in organized folders Β· Batch Processing - Handle multiple files at once Β· Download Ready - Instant download of processed files
π‘ Key Features:
Β· π― Accurate Transcription - AI-powered speech recognition Β· π Multi-format Support - Works with audio, video, and text files Β· β‘ Fast Processing - Local processing for quick results Β· π Smart Storage - Automatic file organization Β· π§ Text Tools - Complete text cleaning and conversion toolkit
π― Perfect For:
Β· Content Creators - Transcribe podcasts, videos, interviews Β· Students & Researchers - Convert lectures and research materials Β· Business Professionals - Meeting transcriptions and document processing Β· Writers & Translators - Text cleaning and format conversion Β· Anyone needing to convert speech to text or process text files efficiently!
"Your All-in-One Solution for Audio Transcription and Text Processing - Fast, Accurate, and Completely Offline!" π
Want to talk or ask something?
Just click the YouTube link below! You'll find my π§ email there and can message me easily. π
π₯ YouTube Channel: @nzg73
π https://youtube.com/@NZG73
Contact Email πππ
E-mail:
nzgnzg73@gmail.com
1. Clone the Repository
Python 3.10.11 Open your terminal or command prompt and clone the repository.
git clone https://github.com/nzgnzg73/Fast-Whisper-Small-Webui.git
cd Fast-Whisper-Small-Webui
1. Clone the Repository
2. Set Up a Python Virtual Environment
Create a virtual environment using python 3.10 to avoid dependency conflicts
py -3.10 -m venv venv
3. Activate the virtual environment.
venv\scripts\activate
GPU:
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
CPU:
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu
4. Install the Project and Dependencies
Users with 10 series NVidia cards or AMD GPUs need to manually install the proper torch 2.6.0 versions. Otherwise just install from requirements.txt
pip install -r requirements.txt
Running the Application
With your virtual environment still active, run the script:
python app.py