mistralai/Voxtral-Mini-4B-Realtime-2602
Automatic Speech Recognition β’ 4B β’ Updated β’ 1.16M β’ 840
Upgraded to v1.0!
https://huggingface.co/papers/2501.03006
View and submit LLM evaluations
Gaze detection using Moondream
Audio Conditioned LipSync with Latent Diffusion Models
Describe what you want, AI writes the FFMPEG command