nvidia/llama-nemotron-embed-vl-1b-v2 Sentence Similarity • 2B • Updated 26 days ago • 114k • 66
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation Paper • 2509.19296 • Published Sep 23, 2025 • 32
Running Agents Featured 134 Open VLM Video Leaderboard 🌎 134 VLMEvalKit Eval Results in video understanding benchmark
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 15 days ago • 239
InfoBayAI/Audio-to-Sentiment_Intelligence_Model Audio-Text-to-Text • 67M • Updated 7 days ago • 18 • 5