How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="automajicly/qwen-1.5b-android",
	filename="",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Qwen 2.5 1.5B Android Quantized versions of Qwen 2.5 1.5B Instruct optimized for Android devices. Models • qwen-1.5b-q4.gguf (1.12 GB): High-end Android phones (8GB+ RAM) • qwen-1.5b-q3.gguf (924 MB): Mid-range Android phones (4GB+ RAM) Both use GGUF format for fast inference with llama.cpp. Usage Download the model file and use with any GGUF-compatible app. EOF

Downloads last month
322
GGUF
Model size
2B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support