view article Article MAD GRPO: Treating Dr. GRPO that tried to fix GRPO but brought instability and verbosity bias Jan 17 • 2
Running on Zero Agents Featured 381 DLSS 5 Anything 🎮 381 Turn any image into a DLSS 5 meme (using FLUX.2-klein-9b-kv)
kotoba-tech/kotoba-whisper-v2.0 Automatic Speech Recognition • 0.8B • Updated Oct 23, 2024 • 10.8k • 90
facebook/dinov3-convnext-small-pretrain-lvd1689m Image Feature Extraction • 49.5M • Updated Aug 19, 2025 • 19.9k • 27
Configuration error Agents Featured 1.45k EasyControl Ghibli 🦀 1.45k New Ghibli EasyControl model is now released!!
Running on Zero Agents Featured 199 Chat with Kimi-VL-A3B-Thinking-2506 🤔 199 Chat with Kimi-VL: respond to text, images, video, PDFs
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7, 2025 • 207
AppAgentX: Evolving GUI Agents as Proficient Smartphone Users Paper • 2503.02268 • Published Mar 4, 2025 • 11
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 380k • 1.6k