How to use from
Lemonade
Pull the model
# Download Lemonade from https://lemonade-server.ai/
lemonade pull teragron/quantest:Q2_K
Run and chat with the model
lemonade run user.quantest-Q2_K
List all available models
lemonade list
Quick Links

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

./llama-server -m SmolVLM-256M-Instruct-Q2_K.gguf --mmproj mmproj-SmolVLM-256M-Instruct-TQ2.gguf --host 0.0.0.0 -c 5

Downloads last month
21
GGUF
Model size
0.2B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

2-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support