Measuring Maximum Activations in Open Large Language Models Paper โข 2605.15572 โข Published 5 days ago โข 16
EndPrompt: Efficient Long-Context Extension via Terminal Anchoring Paper โข 2605.14589 โข Published 6 days ago โข 12
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook ๐ 3.18k The secrets to building world-class LLMs