Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Paper • 2603.06569 • Published 6 days ago • 100
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper • 2512.15603 • Published Dec 17, 2025 • 66
darknoon/svg-stack-filtered-sft-qwen2.5-vl-7b-trl-10k Image-Text-to-Text • 8B • Updated Aug 14, 2025 • 1
darknoon/svg-stack-filtered-sft-qwen2.5-vl-7b-trl-10k Image-Text-to-Text • 8B • Updated Aug 14, 2025 • 1
CohereLabs/command-a-vision-07-2025 Image-Text-to-Text • 112B • Updated Oct 30, 2025 • 41.9k • • 85