TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices Paper • 2410.00531 • Published Oct 1, 2024 • 33
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 38 items • Updated 18 days ago • 357