Running 3.83k The Ultra-Scale Playbook š 3.83k The ultimate guide to training LLM on large GPU Clusters
mistral-community/Mixtral-8x22B-v0.1 Text Generation ⢠141B ⢠Updated Jul 1, 2024 ⢠245 ⢠671