view article Article Ulysses Sequence Parallelism: Training with Million-Token Contexts 4 days ago • 14
Running on CPU Upgrade Featured 3.04k The Smol Training Playbook 📚 3.04k The secrets to building world-class LLMs
T5 release Collection The original T5 transformer release was done in two steps, the original T5 checkpoints and the improved T5v1 • 9 items • Updated about 11 hours ago • 18
UniVideo: Unified Understanding, Generation, and Editing for Videos Paper • 2510.08377 • Published Oct 9, 2025 • 81
VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published Jan 5 • 30
Unsloth Diffusion GGUFs Collection Find GGUFs and other variants of diffusion based models like Qwen-Image and FLUX. • 20 items • Updated about 21 hours ago • 57