cosmo3769/train_synthetic_dataset_21.4k_images_nanovlm_full_data Image-Text-to-Text • 0.2B • Updated Jun 28, 2025 • 9 • 1
lambertxiao/Vision-Language-Vision-Captioner-Qwen2.5-3B Image-to-Text • 5B • Updated Sep 2, 2025 • 25 • 2