meituan-longcat/LongCat-Flash-Chat Text Generation • 562B • Updated Sep 24, 2025 • 44.5k • 529
Running 3.76k The Ultra-Scale Playbook 🌌 3.76k The ultimate guide to training LLM on large GPU Clusters
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models Paper • 2309.14717 • Published Sep 26, 2023 • 46