How to use from
Lemonade
Pull the model
# Download Lemonade from https://lemonade-server.ai/
lemonade pull quantflex/MicroThinker-1B-Preview-GGUF:
Run and chat with the model
lemonade run user.MicroThinker-1B-Preview-GGUF-
List all available models
lemonade list
Quick Links

QuantFlex Banner

GGUF Quants for: MicroThinker-1B-Preview

Model by: huihui-ai (thank you!)

Quants by: quantflex

Run with llama.cpp:

./llama-cli -m MicroThinker-1B-Preview-Q5_K_M.gguf -cnv -p "You are a helpful assistant. You should think step-by-step." --chat-template llama3

Downloads last month
41
GGUF
Model size
1B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

5-bit

6-bit

8-bit

32-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for quantflex/MicroThinker-1B-Preview-GGUF