MJPansa/MiniMax-M2.7-REAP-172B-A10B-AutoRound-W4A16 Text Generation β’ 24B β’ Updated 22 days ago β’ 4.32k β’ 9
MJPansa/MiniMax-M2.7-REAP-172B-A10B-AutoRound-W4A16 Text Generation β’ 24B β’ Updated 22 days ago β’ 4.32k β’ 9
MJPansa/MiniMax-M2.7-REAP-172B-A10B-AutoRound-W4A16 Text Generation β’ 24B β’ Updated 22 days ago β’ 4.32k β’ 9
Running 3.83k The Ultra-Scale Playbook π 3.83k The ultimate guide to training LLM on large GPU Clusters
openGPT-X/Teuken-7B-instruct-commercial-v0.4 Text Generation β’ 7B β’ Updated Dec 11, 2024 β’ 1.39k β’ 74
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published Dec 18, 2024 β’ 163