Running 3.62k The Ultra-Scale Playbook 🌌 3.62k The ultimate guide to training LLM on large GPU Clusters
Running on Zero Featured 2.01k Chat With Janus-Pro-7B 🌍 2.01k A unified multimodal understanding and generation model.
cyberagent/DeepSeek-R1-Distill-Qwen-32B-Japanese Text Generation • 33B • Updated Jan 27, 2025 • 339 • • 255
Runtime error Featured 5.07k MusicGen 🎵 5.07k Generate music from text descriptions and optional melodies