Tiny Series Collection Tiny datasets that empower the foundation of Small Language Model! • 14 items • Updated May 13 • 45
DFlash Collection Block Diffusion for Flash Speculative Decoding • 23 items • Updated about 11 hours ago • 139
Interactivity Alignment Collection Full-duplex speech models post-trained with reinforcement learning for improved conversational interactivity. • 4 items • Updated 19 days ago • 6
Stable Audio 3 Extra Collection Contains all checkpoints that are not the standard post-trained checkpoints found in https://huggingface.co/collections/stabilityai/stable-audio-3 • 7 items • Updated May 20 • 11
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 17 days ago • 168
SpectroStream: A Versatile Neural Codec for General Audio Paper • 2508.05207 • Published Aug 7, 2025 • 3
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 164