Dual Strix Halo Quants Collection Quantization of models designed to fit within the memory constraints of 2x Strix Halo machines. Can also be ran on any generic hardware using vLLM. • 1 item • Updated 7 days ago
ayysasha/MiniMax-M2.7-AWQ-G32-STRIX-2H Text Generation • 51B • Updated 15 days ago • 1.64k • 4
ayysasha/MiniMax-M2.7-AWQ-G32-STRIX-2H Text Generation • 51B • Updated 15 days ago • 1.64k • 4