YOLO26 Models Collection YOLO26 models: detection, segmentation, classification, pose, and OBB variants with demos and ONNX variants. • 42 items • Updated 9 days ago • 31
Density-Vs-Diversity-Blogpost Collection The collection contains the artefacts used to do the analysis for the blogpost: Diversity Vs Density: A strategy comparison for fine-tuning VLMs • 7 items • Updated 22 days ago • 2
view article Article Diversity Vs Density: A data strategy comparison for fine-tuning VLMs 23 days ago • 5
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 149
view article Article When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance Sep 30, 2025 • 12
Benchmark It Yourself (BIY): Preparing a Dataset and Benchmarking AI Models for Scatterplot-Related Tasks Paper • 2510.06071 • Published Oct 7, 2025 • 2
view article Article Introducing Command A Vision: Multimodal AI built for Business Jul 31, 2025 • 63
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders Jul 9, 2025 • 772
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens Paper • 2506.17218 • Published Jun 20, 2025 • 29
VisionZip: Longer is Better but Not Necessary in Vision Language Models Paper • 2412.04467 • Published Dec 5, 2024 • 117
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2, 2025 • 148
view article Article *Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings Jun 2, 2025 • 27
Why Vision Language Models Struggle with Visual Arithmetic? Towards Enhanced Chart and Geometry Understanding Paper • 2502.11492 • Published Feb 17, 2025 • 2
ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models Paper • 2505.13444 • Published May 19, 2025 • 17