Quantization of models designed to fit within the memory constraints of 2x Strix Halo machines. Can also be ran on any generic hardware using vLLM.
Sasha
ayysasha
AI & ML interests
None yet
Recent Activity
new activity 7 days ago
z-lab/MiniMax-M2.7-DFlash:Please kindly approve my request to access this model updated a collection 9 days ago
Dual Strix Halo Quants updated a model 14 days ago
ayysasha/MiniMax-M2.7-AWQ-G32-STRIX-2HOrganizations
None yet