amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-cpu Text Generation • Updated Jan 30, 2025
Quark Quantized ONNX LLMs for Ryzen AI 1.3 EA Collection ONNX Runtime generate() API based models quantized by Quark and optimized for Ryzen AI Strix Point NPU • 8 items • Updated Feb 19 • 8