MLC version of microsoft/Phi-3-mini-4k-instruct, using q0f16 quantization.
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for Felladrin/mlc-q0f16-Phi-3-mini-4k-instruct
Base model
microsoft/Phi-3-mini-4k-instruct