Qwen2.5‑Coder‑3B‑Instruct

This will download model files (~2.1 GB, quantized q4) to your browser cache and run inference via WebGPU. Proceed?

0%

WebGPU requires HTTPS or localhost. Chrome/Edge supported by default; Safari/Firefox may need experimental flags.

Qwen2.5‑Coder‑3B‑Instruct — WebGPU (ONNX)