Running on CPU Upgrade Featured 3.17k The Smol Training Playbook ๐ 3.17k The secrets to building world-class LLMs
Landmark Attention: Random-Access Infinite Context Length for Transformers Paper โข 2305.16300 โข Published May 25, 2023 โข 1