meituan-longcat/LongCat-Flash-Chat Text Generation • 562B • Updated Sep 24, 2025 • 85.7k • 534
Running 3.86k The Ultra-Scale Playbook 🌌 3.86k The ultimate guide to training LLM on large GPU Clusters
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models Paper • 2309.14717 • Published Sep 26, 2023 • 46