VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 247
Running on Zero Agents Featured 474 Parakeet-TDT-0.6b-V2 474 Transcribe audio files with timestamps and downloadable subtitles
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook 📚 3.22k The secrets to building world-class LLMs
Running Featured 93 Parakeet STT Progressive Transcription 🎤 93 Transcribe speech to text instantly with WebGPU acceleration
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.72M • • 3.11k
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations Paper • 2108.01073 • Published Aug 2, 2021 • 9
Running on Zero Agents Featured 140 Qwen3-ASR Demo 🎙 140 Transcribe audio to text with timestamps and visualization
Running on CPU Upgrade Agents 2.01k Omni Image Editor 🖼 2.01k Image edit, text to image, image upscale, remove watermark
Running on Zero Agents Featured 1.99k Qwen3-TTS Demo 🎙 1.99k Generate speech from text using voice design, cloning or presets