Running on Zero Agents 34 XL Model Experiments π 34 Generate images from text prompts using Stable Diffusion XL
Running Agents 153 Qwen3.5 Omni Offline Demo π 153 Chat with a multimodal AI using text, audio, images or video
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper β’ 2512.10942 β’ Published Dec 11, 2025 β’ 61