Running Featured 137 Voxtral Realtime WebGPU 💬 137 Real-time speech transcription, entirely in your browser.
Running on Zero Agents Featured 2k Qwen3-TTS Demo 🎙 2k Generate speech from text using voice design, cloning or presets
Running Agents 25 Audio To MIDI And Advanced Renderer 🎹 25 Audio to MIDI Transcription and Advanced render
MIT/ast-finetuned-audioset-10-10-0.4593 Audio Classification • 86.6M • Updated Sep 6, 2023 • 516k • 359
Running on Zero Agents Featured 2.58k Qwen Image Multiple Angles 3D Camera 🎥 2.58k Transform image viewpoint with adjustable camera angles