metadata
license: apache-2.0
JoyAI-VL-Interaction
The first open, vision-driven real-time interaction model — it watches a live video stream and decides on its own when to speak, stay silent, or delegate.
📄 Paper · 🌐 Project Page & Demos · 💻 GitHub · 🤗 Paper Page