Gaperon Collection Our French-English LLM suite (including Base and SFT models. All checkpoints are also included. • 16 items • Updated about 21 hours ago • 17
Gaperon: A Peppered English-French Generative Language Model Suite Paper • 2510.25771 • Published Oct 29, 2025 • 16
view article Article There is no such thing as a tokenizer-free lunch catherinearnett • Sep 25, 2025 • 98
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper • 2503.02812 • Published Mar 4, 2025 • 10
Headless Language Models: Learning without Predicting with Contrastive Weight Tying Paper • 2309.08351 • Published Sep 15, 2023 • 3