-
-
-
-
-
-
Inference Providers
Active filters:
Sa2VA
Image-Text-to-Text
•
4B
•
Updated
•
156k
•
94
ByteDance/Sa2VA-Qwen3-VL-2B
Image-Text-to-Text
•
3B
•
Updated
•
39
•
15
Image-Text-to-Text
•
4B
•
Updated
•
3
Dense-World/Sa2VA_InternVL2.5_4b
Image-Text-to-Text
•
4B
•
Updated
•
3
•
1
Dense-World/Sa2VA_InternVL2.5_8b
Image-Text-to-Text
•
8B
•
Updated
•
1
Dense-World/Sa2VA_InternVL2.5_26b
Image-Text-to-Text
•
26B
•
Updated
•
5
Image-Text-to-Text
•
8B
•
Updated
•
1.27k
•
65
Image-Text-to-Text
•
1B
•
Updated
•
1.13k
•
29
Image-Text-to-Text
•
26B
•
Updated
•
90
•
31
Image Segmentation
•
4B
•
Updated
•
2
Image Segmentation
•
1B
•
Updated
•
232
Image Segmentation
•
8B
•
Updated
•
3
Image Segmentation
•
26B
•
Updated
•
3
ByteDance/Sa2VA-InternVL3-2B
Image-Text-to-Text
•
2B
•
Updated
•
114
•
1
ByteDance/Sa2VA-InternVL3-8B
Image-Text-to-Text
•
8B
•
Updated
•
91
•
4
ByteDance/Sa2VA-InternVL3-14B
Image-Text-to-Text
•
15B
•
Updated
•
51
•
9
ByteDance/Sa2VA-Qwen2_5-VL-3B
Image-Text-to-Text
•
4B
•
Updated
•
154
•
2
ByteDance/Sa2VA-Qwen2_5-VL-7B
Image-Text-to-Text
•
9B
•
Updated
•
105
•
4
ByteDance/Sa2VA-Qwen3-VL-4B
Image-Text-to-Text
•
5B
•
Updated
•
1.2k
•
14