Running 3.77k The Ultra-Scale Playbook π 3.77k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-Coder-V2-Instruct Text Generation β’ 236B β’ Updated Aug 21, 2024 β’ 6.64k β’ 682
Qwen/Qwen2.5-Coder-32B-Instruct Text Generation β’ 33B β’ Updated Jan 12, 2025 β’ 1.11M β’ β’ 2k
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation β’ Updated Feb 24, 2025 β’ 153k β’ β’ 764
Running on Zero MCP Featured 2.01k Stable Video Diffusion 1.1 πΊ 2.01k Create a short video from a single image
jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition β’ 0.3B β’ Updated Mar 25, 2023 β’ 42.4k β’ 478