Vision-Language moonshotai/Kimi-VL-A3B-Thinking Image-Text-to-Text • 16B • Updated Aug 18 • 13.9k • 442 OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 10.6k • 48 LifuWang/DistillT5 0.1B • Updated Apr 11 • 142 • 28
OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 10.6k • 48
Vision-Language moonshotai/Kimi-VL-A3B-Thinking Image-Text-to-Text • 16B • Updated Aug 18 • 13.9k • 442 OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 10.6k • 48 LifuWang/DistillT5 0.1B • Updated Apr 11 • 142 • 28
OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 10.6k • 48