view article Article Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP +3 ariG23498, ror, sergiopaniego, pcuenq, sayakpaul • 19 days ago • 50
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 itazap, ariG23498, ArthurZ, sergiopaniego, merve, pcuenq • Dec 18, 2025 • 125
ARC-Encoders Collection Pretrained ARC-Encoders and a fine-tuning dataset: context compression for unmodified LLMs. • 6 items • Updated Mar 26 • 5
view article Article Luth: Efficient French Specialization for Small Language Models MaxLSB • Aug 11, 2025 • 21
view article Article Should We Still Pretrain Encoders with Masked Language Modeling? Nicolas-BZRD • Jul 2, 2025 • 22
view article Article 🇪🇺 EU AI Act: Comments on the Third Code of Practice Draft 🇪🇺 frimelle • Mar 13, 2025 • 9
Reducing the Footprint of Multi-Vector Retrieval with Minimal Performance Impact via Token Pooling Paper • 2409.14683 • Published Sep 23, 2024 • 11