A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone Paper • 2505.12781 • Published May 19, 2025 • 2
Low-Rank Clone (LRC) Collection Model checkpoints for paper "A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone". • 10 items • Updated Jul 11, 2025 • 1
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 148