File size: 719 Bytes

cf90e2e
 
52f74e1
 
cf90e2e
52f74e1
cf90e2e
 
52f74e1
 
 
cf90e2e
52f74e1
 
 
cf90e2e
52f74e1
0c8fb25
cf90e2e
52f74e1
 
 
cf90e2e

---
tags:
- gguf
- llama.cpp
- unsloth

---

# flash : GGUF

This model was finetuned and converted to GGUF format using [Unsloth](https://github.com/unslothai/unsloth).

**Example usage**:
- For text only LLMs:    `llama-cli -hf assemsabry/flash --jinja`
- For multimodal models: `llama-mtmd-cli -hf assemsabry/flash --jinja`

## Available Model files:
- `Llama-3.1-Minitron-4B-Width-Base.F16.gguf`

## Note
The model's BOS token behavior was adjusted for GGUF compatibility.
This was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth)
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)