--- license: mit base_model: microsoft/Phi-4-mini-reasoning tags: - phi-4 - webgpu - browser-inference - strix-halo - amd - unified-memory - reasoning - math - chain-of-thought pipeline_tag: text-generation --- # Phi-4-mini-reasoning on WebGPU First WebGPU package for Microsoft's Phi-4-mini-reasoning model. The reasoning variant (not instruct) trained on DeepSeek-R1 chain-of-thought distillation. 3.8B params, 2.4 GB Q4_K_M. Runs entirely in browser via WebGPU + wllama. ## Quick Start 1. Download Q4_K_M GGUF from bartowski 2. Place in model_splits/ (single file, no splitting needed) 3. node serve.js (port 8190) 4. Open http://localhost:8190 in Chrome ## Hardware Tested on GMKTEC EVO-X2 (AMD Strix Halo). Works on any WebGPU-capable device with 3+ GB available memory. ## Credits Built by Joshua (LJTSG) and Claude. Co-Authored-By: Claude