How to use from
Lemonade
Pull the model
# Download Lemonade from https://lemonade-server.ai/
lemonade pull assemsabry/flash:
Run and chat with the model
lemonade run user.flash-
List all available models
lemonade list
Quick Links

flash : GGUF

This model was finetuned and converted to GGUF format using Unsloth.

Example usage:

  • For text only LLMs: llama-cli -hf assemsabry/flash --jinja
  • For multimodal models: llama-mtmd-cli -hf assemsabry/flash --jinja

Available Model files:

  • Llama-3.1-Minitron-4B-Width-Base.F16.gguf

Note

The model's BOS token behavior was adjusted for GGUF compatibility. This was trained 2x faster with Unsloth

Downloads last month
164
Safetensors
Model size
5B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support