Nano Stuff Collection A collection of all the Nano models I made. Nano Stuff contains from SR to music generation and so on. • 3 items • Updated about 17 hours ago
Running on CPU Upgrade 19 Transformer Autocomplete 🏆 19 Generate text continuations with modern transformer models
Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces Paper • 2605.02801 • Published 3 days ago • 4
HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness Paper • 2605.02396 • Published 3 days ago • 14
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL Paper • 2604.28123 • Published 6 days ago • 39