Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

video-captioning

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

31

Base only

Active filters: video-captioning

NemoStation/Marlin-2B

Video-Text-to-Text • 2B • Updated 28 days ago • 15.3k • 544

Neleac/SpaceTimeGPT

Video-Text-to-Text • Updated Dec 2, 2025 • 25 • 32

kpyu/video-blip-opt-2.7b-ego4d

Image-to-Text • Updated May 17, 2023 • 195 • 20

kpyu/video-blip-flan-t5-xl-ego4d

Image-to-Text • Updated May 17, 2023 • 16 • 3

kpyu/eilev-blip2-opt-2.7b

Image-to-Text • 4B • Updated Oct 22, 2024 • 8 • 4

kpyu/eilev-blip2-flan-t5-xl

Image-to-Text • 4B • Updated Oct 22, 2024 • 8 • 1

jylins/vtsum_blip

Updated Apr 22, 2024 • 3

openinterx/UGC-VideoCaptioner

Video-Text-to-Text • 6B • Updated Jul 19, 2025 • 121 • 4

TencentARC/ARC-Hunyuan-Video-7B

Video-Text-to-Text • 9B • Updated Sep 19, 2025 • 1.48k • 38

TencentARC/ARC-Qwen-Video-7B

Video-Text-to-Text • 9B • Updated Sep 21, 2025 • 23 • 9

TencentARC/ARC-Qwen-Video-7B-Narrator

Video-Text-to-Text • 9B • Updated Sep 21, 2025 • 37 • 11

Memories-ai/UGC-VideoCaptioner

Video-Text-to-Text • 6B • Updated Oct 5, 2025 • 5 • 2

chancharikm/CHAI_SFT_model_8b

Video-Text-to-Text • 770k • Updated May 13 • 809 • 1

AudioVisual-Caption/ASID-Captioner-3B

Image-Text-to-Text • 5B • Updated Mar 11 • 10 • 37

AudioVisual-Caption/ASID-Captioner-7B

Image-Text-to-Text • 9B • Updated Mar 11 • 21 • 6

mradermacher/ASID-Captioner-7B-GGUF

8B • Updated Feb 27 • 137 • 1

mradermacher/ASID-Captioner-3B-GGUF

3B • Updated Feb 27 • 137 • 1

mradermacher/ASID-Captioner-3B-i1-GGUF

3B • Updated Feb 28 • 101 • 1

mradermacher/ASID-Captioner-7B-i1-GGUF

8B • Updated Feb 28 • 122 • 1

oonepieceeyewear/UGC-VideoCaptioner

Video-Text-to-Text • 6B • Updated Apr 15 • 2

mlcglab/synwts

mradermacher/CHAI_SFT_model_8b-GGUF

8B • Updated May 15 • 95 • 1

cudabenchmarktest/video-scan

Video-Text-to-Text • 2B • Updated May 20 • 6

MananSuri27/video2lora-smolvlm2-500m-video-best-ce

Updated May 20 • 1

lunahr/Marlin-2B-ungated

Video-Text-to-Text • 2B • Updated May 22 • 7.51k • 6

momentslab/peek

Other • Updated 25 days ago • 49 • 6

junwatu/Marlin-2B-MLX-8bit

Video-Text-to-Text • 0.9B • Updated about 1 month ago • 138 • 6

NJU-LINK/OmniCaptioner-IF-7B

Image-Text-to-Text • 11B • Updated 18 days ago • 39

NJU-LINK/OmniCaptioner-IF-3B

Image-Text-to-Text • 6B • Updated 18 days ago • 49

tintwotin/Marlin-2B-SDNQ-int8

Video-Text-to-Text • 2B • Updated 18 days ago • 137