Qwen2.5-0.5B-Instruct โ€” CoreML (ANE+GPU Optimized)

Converted from Qwen/Qwen2.5-0.5B-Instruct for on-device inference on Apple devices.

File Size Description
model.mlpackage 302 MB Monolithic decoder with stateful KV cache (int4)
  • HF-exact match: "The capital of France is Paris." โœ…
  • iOS 18+ required (MLState API)

See CoreML-LLM for full details.

Downloads last month
15
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for mlboydaisuke/qwen2.5-0.5b-coreml

Quantized
(192)
this model