inference-optimization/Qwen3-30B-A3B-6.5-bits-mode-heuristic-per-tensor 25B • Updated 15 days ago • 49
inference-optimization/Qwen3-30B-A3B-5.5-bits-mode-heuristic-per-tensor 21B • Updated 15 days ago • 42
inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-noise-per-tensor 7B • Updated 15 days ago • 41
inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-hybrid-per-tensor 7B • Updated 15 days ago • 38
inference-optimization/Llama-3.1-8B-Instruct-7-bits-mode-heuristic-per-tensor 7B • Updated 15 days ago • 44
inference-optimization/Llama-3.1-8B-Instruct-6.5-bits-mode-noise-per-tensor 7B • Updated 15 days ago • 37
inference-optimization/Llama-3.1-8B-Instruct-6.5-bits-mode-hybrid-per-tensor 7B • Updated 15 days ago • 52