LJTSG
/

Phi-4-mini-reasoning-webgpu

Text Generation

browser-inference

chain-of-thought

Model card Files Files and versions

Phi-4-mini-reasoning-webgpu / README.md

LJTSG's picture

Initial: Phi-4-mini-reasoning WebGPU

e88a3ae verified 1 day ago

|

history blame contribute delete

897 Bytes

	---
	license: mit
	base_model: microsoft/Phi-4-mini-reasoning
	tags:
	- phi-4
	- webgpu
	- browser-inference
	- strix-halo
	- amd
	- unified-memory
	- reasoning
	- math
	- chain-of-thought
	pipeline_tag: text-generation
	---

	# Phi-4-mini-reasoning on WebGPU

	First WebGPU package for Microsoft's Phi-4-mini-reasoning model.

	The reasoning variant (not instruct) trained on DeepSeek-R1 chain-of-thought distillation. 3.8B params, 2.4 GB Q4_K_M. Runs entirely in browser via WebGPU + wllama.

	## Quick Start

	1. Download Q4_K_M GGUF from bartowski
	2. Place in model_splits/ (single file, no splitting needed)
	3. node serve.js (port 8190)
	4. Open http://localhost:8190 in Chrome

	## Hardware

	Tested on GMKTEC EVO-X2 (AMD Strix Halo). Works on any WebGPU-capable device with 3+ GB available memory.

	## Credits

	Built by Joshua (LJTSG) and Claude.
	Co-Authored-By: Claude <noreply@anthropic.com>