How to use from
Unsloth Studio
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for assemsabry/flash to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for assemsabry/flash to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for assemsabry/flash to start chatting
Quick Links

flash : GGUF

This model was finetuned and converted to GGUF format using Unsloth.

Example usage:

  • For text only LLMs: llama-cli -hf assemsabry/flash --jinja
  • For multimodal models: llama-mtmd-cli -hf assemsabry/flash --jinja

Available Model files:

  • Llama-3.1-Minitron-4B-Width-Base.F16.gguf

Note

The model's BOS token behavior was adjusted for GGUF compatibility. This was trained 2x faster with Unsloth

Downloads last month
164
Safetensors
Model size
5B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support