YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Qwen 2.5 1.5B Android Quantized versions of Qwen 2.5 1.5B Instruct optimized for Android devices. Models • qwen-1.5b-q4.gguf (1.12 GB): High-end Android phones (8GB+ RAM) • qwen-1.5b-q3.gguf (924 MB): Mid-range Android phones (4GB+ RAM) Both use GGUF format for fast inference with llama.cpp. Usage Download the model file and use with any GGUF-compatible app. EOF

Downloads last month
322
GGUF
Model size
2B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support