vid analysis
updated
prithivMLmods/Qwen3-VL-8B-Abliterated-Caption-it
Image-Text-to-Text
• 9B • Updated • 125
• 32
mradermacher/Qwen3-VL-8B-NSFW-Caption-V4.5-GGUF
8B • Updated • 8.52k
• 66
prithivMLmods/Qwen3-VisionCaption-2B
Image-Text-to-Text
• 2B • Updated • 69
• 5
msrcam/Qwen3-VL-2B-Instruct-heretic
2B • Updated • 24
ghost-actual/Qwen3.5-4B-Claude-Opus-4.6-Distilled-heretic
Text Generation
• 5B • Updated • 83
• 3
Image-Text-to-Text
• 3B • Updated • 12
• 5
Image-Text-to-Text
• 5B • Updated • 9.47M
• • 691
bobber/routangseng-qwen35-0.8b-abliterated-onnx
Image-Text-to-Text
• Updated • 28
bobber/routangseng-0.8b-hottake-onnx
Image-Text-to-Text
• Updated • 3
Caplin43/multimodal-vision-language-mini
Image-to-Text
• Updated • 4
Ytgetahun/visual-narrator-llm
Image-to-Text
• Updated
Ytgetahun/visual-narrator-vlm
0.2B • Updated • 3
allenai/MolmoPoint-Vid-4B
Video-Text-to-Text
• 5B • Updated • 404
• 12
TencentARC/ARC-Qwen-Video-7B-Narrator
Video-Text-to-Text
• 9B • Updated • 37
• 11
Video-Text-to-Text
• Updated • 25
• 32
Image-Text-to-Text
• 5B • Updated • 31.2k
• 51
DAMO-NLP-SG/VideoLLaMA3-2B
Video-Text-to-Text
• 2B • Updated • 2.14k
• 21
u94fmn391j/SAVANT-scene-description-lora
Image-to-Text
• Updated • 4
VINAY-UMRETHE/SigMamba-V1-Large
Video Classification
• 0.9B • Updated • 285
• 5
qoranet/QORA-Vision-Video
Video Classification
• Updated • 4
sumit7488/TimesFormer_Baseline
Video Classification
• 0.1B • Updated • 2
StreamFormer/streamformer-timesformer
Video Classification
• 0.1B • Updated • 24
• 4
facebook/vjepa2-vitg-fpc64-256
Video Classification
• 1B • Updated • 284k
• 56
Image-Text-to-Text
• 3B • Updated • 12.2k
• 201
Video-Text-to-Text
• Updated • 2
BidirLM/BidirLM-Omni-2.5B-Embedding
Sentence Similarity
• 2B • Updated • 376
• 44
Image-Text-to-Text
• Updated • 15
• 14
Video-Text-to-Text
• 0.9B • Updated • 3
• 1
Video-Text-to-Text
• 2B • Updated • 15.3k
• 544
LongVie 2: Multimodal Controllable Ultra-Long Video World Model
Paper
• 2512.13604
• Published • 76
Image-Text-to-Text
• 4B • Updated • 495k
• 2.38k
prithivMLmods/Gemma4-BLIP3o-Captioner-5B
Image-Text-to-Text
• 5B • Updated • 1.78k
• 2
lewiswatson/Frame2KG-LFM-2.5-450m-JSON
Image-Text-to-Text
• 0.4B • Updated • 122