Qwen/Qwen3-VL-30B-A3B-Instruct Image-Text-to-Text ⢠31B ⢠Updated Nov 26, 2025 ⢠591k ⢠⢠580
Cosmos-Predict2.5 Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/cosmos3 ⢠2 items ⢠Updated 13 days ago ⢠23
view article Article State of open video generation models in Diffusers +1 sayakpaul, a-r-r-o-w, dn6 ⢠Jan 27, 2025 ⢠71
view article Article Small Language Models (SLM): A Comprehensive Overview jjokah ⢠Feb 22, 2025 ⢠163
Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper ⢠2604.12374 ⢠Published Apr 14 ⢠37
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers tomaarsen ⢠Apr 16 ⢠72
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift ⢠Apr 2 ⢠909
Sleeping Agents 1 Context-CrackNet Pavement Crack Analyzer š¬ 1 Segment & measure pavement cracks in pixel space
Sleeping Agents 1 Context-CrackNet Pavement Crack Analyzer š¬ 1 Segment & measure pavement cracks in pixel space