vid analysis - a cinnavun Collection

cinnavun 's Collections

vid analysis

updated 11 days ago

prithivMLmods/Qwen3-VL-8B-Abliterated-Caption-it

Image-Text-to-Text • 9B • Updated 26 days ago • 125 • 32
mradermacher/Qwen3-VL-8B-NSFW-Caption-V4.5-GGUF

8B • Updated Nov 8, 2025 • 8.52k • 66
prithivMLmods/Qwen3-VisionCaption-2B

Image-Text-to-Text • 2B • Updated Nov 29, 2025 • 69 • 5
msrcam/Qwen3-VL-2B-Instruct-heretic

2B • Updated 19 days ago • 24
ghost-actual/Qwen3.5-4B-Claude-Opus-4.6-Distilled-heretic

Text Generation • 5B • Updated Mar 7 • 83 • 3
hustvl/mmMamba-linear

Image-Text-to-Text • 3B • Updated Feb 26, 2025 • 12 • 5
Qwen/Qwen3.5-4B

Image-Text-to-Text • 5B • Updated Mar 2 • 9.47M • • 691
bobber/routangseng-qwen35-0.8b-abliterated-onnx

Image-Text-to-Text • Updated Mar 11 • 28
bobber/routangseng-0.8b-hottake-onnx

Image-Text-to-Text • Updated Mar 18 • 3
Caplin43/multimodal-vision-language-mini

Image-to-Text • Updated Feb 27 • 4
Ytgetahun/visual-narrator-llm

Image-to-Text • Updated May 11
Ytgetahun/visual-narrator-vlm

0.2B • Updated Nov 5, 2025 • 3
allenai/MolmoPoint-Vid-4B

Video-Text-to-Text • 5B • Updated Mar 30 • 404 • 12
TencentARC/ARC-Qwen-Video-7B-Narrator

Video-Text-to-Text • 9B • Updated Sep 21, 2025 • 37 • 11
Neleac/SpaceTimeGPT

Video-Text-to-Text • Updated Dec 2, 2025 • 25 • 32
allenai/Molmo2-4B

Image-Text-to-Text • 5B • Updated Jan 23 • 31.2k • 51
DAMO-NLP-SG/VideoLLaMA3-2B

Video-Text-to-Text • 2B • Updated Sep 3, 2025 • 2.14k • 21
u94fmn391j/SAVANT-scene-description-lora

Image-to-Text • Updated Feb 22 • 4
VINAY-UMRETHE/SigMamba-V1-Large

Video Classification • 0.9B • Updated 1 day ago • 285 • 5
qoranet/QORA-Vision-Video

Video Classification • Updated Mar 1 • 4
sumit7488/TimesFormer_Baseline

Video Classification • 0.1B • Updated Mar 19 • 2
StreamFormer/streamformer-timesformer

Video Classification • 0.1B • Updated Aug 10, 2025 • 24 • 4
facebook/vjepa2-vitg-fpc64-256

Video Classification • 1B • Updated Aug 11, 2025 • 284k • 56
ATH-MaaS/Ovis2.5-2B

Image-Text-to-Text • 3B • Updated Feb 13 • 12.2k • 201
byminji/TC-CLIP

Video-Text-to-Text • Updated Mar 3 • 2
BidirLM/BidirLM-Omni-2.5B-Embedding

Sentence Similarity • 2B • Updated May 12 • 376 • 44
TencentARC/TokLIP

Image-Text-to-Text • Updated Aug 21, 2025 • 15 • 14
UserJoseph/DisTime-1B

Video-Text-to-Text • 0.9B • Updated Sep 17, 2025 • 3 • 1
NemoStation/Marlin-2B

Video-Text-to-Text • 2B • Updated 28 days ago • 15.3k • 544
LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published Dec 15, 2025 • 76
nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated 15 days ago • 495k • 2.38k
prithivMLmods/Gemma4-BLIP3o-Captioner-5B

Image-Text-to-Text • 5B • Updated 14 days ago • 1.78k • 2
lewiswatson/Frame2KG-LFM-2.5-450m-JSON

Image-Text-to-Text • 0.4B • Updated 18 days ago • 122