Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

One-click Deployment

Inference Endpoints

Microsoft Foundry

Amazon SageMaker AI

Misc

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

6,864

Base only

Active filters: multimodal

VLA-Arena/pi0-fast-vla-arena-fintuned-LoRA

Robotics • Updated Feb 25

VLA-Arena/smolvla-vla-arena

Robotics • Updated Dec 27, 2025

VLA-Arena/univla-action-decoder

Robotics • Updated Dec 27, 2025

ryanscottbarrett/braille256-v4

Text Generation • Updated Dec 21, 2025 • 6

luosaike/nanoVLM

Image-Text-to-Text • 0.2B • Updated Dec 22, 2025 • 7

anaaa2/visual-moral-compass

Image Classification • Updated Dec 22, 2025 • 4

vivienfanghua/nanoVLM

Image-Text-to-Text • 0.2B • Updated Dec 23, 2025 • 5

PengxiangLi/dart-gui-7b

Image-to-Text • 8B • Updated Dec 23, 2025 • 18 • 1

Dream-org/Dream-VL-7B

Image-Text-to-Text • 8B • Updated Mar 16 • 115 • 13

Dream-org/Dream-VLA-7B

Robotics • 8B • Updated Jan 5 • 27 • 23

siddharth-magesh/clip-flickr30k

Feature Extraction • Updated Dec 24, 2025

amewebstudio/livia-multimodal-v1

10B • Updated Dec 29, 2025 • 2

prithivMLmods/Dolphin-v2-f32-GGUF

Image-Text-to-Text • 3B • Updated Dec 24, 2025 • 1.99k • 2

internlm/CapRL-Qwen3VL-4B

Image-Text-to-Text • 4B • Updated Apr 16 • 858 • 12

ryanscottbarrett/braille256-v6

Text Generation • Updated Dec 24, 2025 • 5

mlx-community/Qwen3-Omni-30B-A3B-Instruct-4bit

Any-to-Any • 7B • Updated Dec 24, 2025 • 244 • 2

mlx-community/Qwen3-Omni-30B-A3B-Instruct-5bit

Any-to-Any • 8B • Updated Dec 24, 2025 • 33

mlx-community/Qwen3-Omni-30B-A3B-Instruct-6bit

Any-to-Any • 9B • Updated Dec 24, 2025 • 33

mlx-community/Qwen3-Omni-30B-A3B-Instruct-8bit

Any-to-Any • 11B • Updated Dec 24, 2025 • 110 • 1

mlx-community/Qwen3-Omni-30B-A3B-Instruct-bf16

Any-to-Any • 35B • Updated Dec 24, 2025 • 196 • 6

cybermotaz/Qwen3-VL-32B-Instruct-NVFP4

Image-Text-to-Text • 18B • Updated Dec 24, 2025 • 101

cybermotaz/Qwen3-Omni-30B-A3B-Instruct-NVFP4

Text Generation • Updated Dec 25, 2025 • 7

Tasfiya025/HAR_MultiModal_Classifier

Updated Dec 25, 2025 • 5

iamthehimansh/LlamaVision-llama-3.3-1b

Image-Text-to-Text • 2B • Updated Dec 25, 2025 • 13 • 1

iioos/multimodal-caption-model

Updated Dec 26, 2025

Cycl0/Molmo2-VideoPoint-4B-bnb-4bit

Video-Text-to-Text • 5B • Updated Dec 26, 2025 • 7

internlm/CapRL-Qwen3VL-2B-GGUF

Image-Text-to-Text • 2B • Updated Dec 29, 2025 • 285 • 5

internlm/CapRL-Qwen3VL-4B-GGUF

Image-Text-to-Text • 4B • Updated Dec 29, 2025 • 185 • 4

mradermacher/CapRL-Qwen3VL-2B-GGUF

2B • Updated Dec 26, 2025 • 81

mradermacher/CapRL-Qwen3VL-4B-GGUF

4B • Updated Dec 26, 2025 • 35 • 1