view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset sdiazlor • Feb 10, 2025 • 59
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation Paper • 2410.03960 • Published Oct 4, 2024 • 2