Qwen2.5-0.5B-Instruct — CoreML (ANE+GPU Optimized)

Converted from Qwen/Qwen2.5-0.5B-Instruct for on-device inference on Apple devices.

File	Size	Description
`model.mlpackage`	302 MB	Monolithic decoder with stateful KV cache (int4)

See CoreML-LLM for full details.

Model tree for mlboydaisuke/qwen2.5-0.5b-coreml

Base model

Finetuned

Quantized

(253)

this model