GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper โข 2601.05242 โข Published 29 days ago โข 221
Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation Paper โข 2602.01756 โข Published 5 days ago โข 22
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation Paper โข 2602.03796 โข Published 3 days ago โข 49
PaperBanana: Automating Academic Illustration for AI Scientists Paper โข 2601.23265 โข Published 7 days ago โข 129
nvidia/canary-qwen-2.5b Automatic Speech Recognition โข 3B โข Updated Dec 15, 2025 โข 130k โข 367
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation Paper โข 2601.22153 โข Published 8 days ago โข 68
Running 105 The Eiffel Tower Llama ๐ 105 Explore the Eiffel Tower Llama experiment with open-source models
Running on Zero MCP Featured 1.69k Z Image Turbo ๐ 1.69k Generate realistic images from text descriptions
Running Featured 103 Supertonic TTS WebGPU โก 103 Blazingly fast text-to-speech 100% locally in your browser
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 โข 291
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper โข 2511.14993 โข Published Nov 19, 2025 โข 231
view article Article Introducing smolagents: simple agents that write actions in code. +1 Dec 31, 2024 โข 1.17k
view reply To understand clearly, you upload the Perquet DS (I do need to store it somewhere, and Perquet is optimized on Hub) here on the Hub and use the streaming feature while having a constant net connection, right?