-
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper • 2512.11253 • Published • 34 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 111 -
Agent READMEs: An Empirical Study of Context Files for Agentic Coding
Paper • 2511.12884 • Published • 19 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 130
MN
ma1664
·
AI & ML interests
None yet
Recent Activity
updated
a collection
7 days ago
Papers
updated
a collection
7 days ago
Papers
updated
a collection
7 days ago
Papers
Organizations
None yet
Models
Spaces
-
RunningFeatured428
FastVLM WebGPU
🍎428Real-time video captioning powered by FastVLM
-
Running on ZeroMCPFeatured1.79k
Qwen Image Edit Camera Control
🎬1.79kFast 4 step inference with Qwen Image Edit 2509
-
Running on ZeroFeatured342
Depth Anything 3
🏢342Create detailed depth maps from images using Depth Anything 3
Datasets
Papers
-
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper • 2512.11253 • Published • 34 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 111 -
Agent READMEs: An Empirical Study of Context Files for Agentic Coding
Paper • 2511.12884 • Published • 19 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 130
Spaces
-
RunningFeatured428
FastVLM WebGPU
🍎428Real-time video captioning powered by FastVLM
-
Running on ZeroMCPFeatured1.79k
Qwen Image Edit Camera Control
🎬1.79kFast 4 step inference with Qwen Image Edit 2509
-
Running on ZeroFeatured342
Depth Anything 3
🏢342Create detailed depth maps from images using Depth Anything 3
Models
Datasets