Running 1 Minimal Conversation (S2S backend, WebSocket) 🎙 1 Voice chat over WebSocket against a HF speech-to-speech
Running on Zero Agents 1 Encoder-Free VLM (DenseFusion-1M + ShareGPT4V) 🖼 1 Encoder-free VLM (Qwen3-1.7B) on DenseFusion-1M + ShareGPT4V
view article Article Eyes, ears, and a voice: building Reachy Mini's media stack pollen-robotics • 13 days ago • 20
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 909