SiwaSathya commited on
Commit
1cad7e8
·
verified ·
1 Parent(s): edd5951

Add README

Browse files
Files changed (1) hide show
  1. README.md +27 -12
README.md CHANGED
@@ -1,21 +1,36 @@
1
  ---
2
- base_model: unsloth/gemma-3-4b-it-unsloth-bnb-4bit
3
  tags:
4
- - text-generation-inference
5
- - transformers
6
  - unsloth
7
- - gemma3
8
- license: apache-2.0
9
- language:
10
- - en
11
  ---
12
 
13
- # Uploaded finetuned model
14
 
15
- - **Developed by:** SiwaSathya
16
- - **License:** apache-2.0
17
- - **Finetuned from model :** unsloth/gemma-3-4b-it-unsloth-bnb-4bit
18
 
19
- This gemma3 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 
 
20
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
21
  [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
1
  ---
 
2
  tags:
3
+ - gguf
4
+ - llama.cpp
5
  - unsloth
6
+ - vision-language-model
 
 
 
7
  ---
8
 
9
+ # model : GGUF
10
 
11
+ This model was finetuned and converted to GGUF format using [Unsloth](https://github.com/unslothai/unsloth).
 
 
12
 
13
+ **Example usage**:
14
+ - For text only LLMs: `./llama.cpp/llama-cli -hf SiwaSathya/model --jinja`
15
+ - For multimodal models: `./llama.cpp/llama-mtmd-cli -hf SiwaSathya/model --jinja`
16
 
17
+ ## Available Model files:
18
+ - `gemma-3-4b-it.Q5_K_M.gguf`
19
+ - `gemma-3-4b-it.Q8_0.gguf`
20
+ - `gemma-3-4b-it.Q4_K_M.gguf`
21
+ - `gemma-3-4b-it.F16-mmproj.gguf`
22
+
23
+ ## ⚠️ Ollama Note for Vision Models
24
+ **Important:** Ollama currently does not support separate mmproj files for vision models.
25
+
26
+ To create an Ollama model from this vision model:
27
+ 1. Place the `Modelfile` in the same directory as the finetuned bf16 merged model
28
+ 3. Run: `ollama create model_name -f ./Modelfile`
29
+ (Replace `model_name` with your desired name)
30
+
31
+ This will create a unified bf16 model that Ollama can use.
32
+
33
+ ## Note
34
+ The model's BOS token behavior was adjusted for GGUF compatibility.
35
+ This was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth)
36
  [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)