view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 890
MolmoWeb Collection This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 8 items • Updated 29 days ago • 23
🛰️🌍 Geospatial Datasets Collection A curated collections of diverse geospatial and satellite imagery datasets. • 52 items • Updated Mar 2 • 32
OlmoEarth Collection OlmoEarth pre-trained and fine-tuned foundation models for remote sensing • 10 items • Updated Dec 23, 2025 • 18
InternVL3.5 Collection This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 45 items • Updated Mar 2 • 109
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated Mar 2 • 79
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces Paper • 2506.00123 • Published May 30, 2025 • 35
view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito • May 12, 2025 • 611
D-FINE Collection State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5, 2025 • 56
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality +2 saurabhdash, olivernan, ArashAhmadian, johndang-cohere • Mar 4, 2025 • 78