hellstone1918 commited on
Commit
4663d49
·
verified ·
1 Parent(s): a14695a

Add README

Browse files
Files changed (1) hide show
  1. README.md +12 -50
README.md CHANGED
@@ -1,59 +1,21 @@
1
  ---
2
- base_model: unsloth/llama-3.2-3b-instruct-bnb-4bit
3
- library_name: transformers
4
- model_name: test-model
5
  tags:
6
- - generated_from_trainer
 
7
  - unsloth
8
- - sft
9
- - trl
10
- licence: license
11
- ---
12
-
13
- # Model Card for test-model
14
-
15
- This model is a fine-tuned version of [unsloth/llama-3.2-3b-instruct-bnb-4bit](https://huggingface.co/unsloth/llama-3.2-3b-instruct-bnb-4bit).
16
- It has been trained using [TRL](https://github.com/huggingface/trl).
17
-
18
- ## Quick start
19
-
20
- ```python
21
- from transformers import pipeline
22
-
23
- question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
24
- generator = pipeline("text-generation", model="hellstone1918/test-model", device="cuda")
25
- output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
26
- print(output["generated_text"])
27
- ```
28
 
29
- ## Training procedure
30
-
31
-
32
-
33
-
34
- This model was trained with SFT.
35
-
36
- ### Framework versions
37
 
38
- - TRL: 0.24.0
39
- - Transformers: 4.57.2
40
- - Pytorch: 2.9.0
41
- - Datasets: 4.3.0
42
- - Tokenizers: 0.22.1
43
 
44
- ## Citations
45
 
 
 
 
46
 
 
 
47
 
48
- Cite TRL as:
49
-
50
- ```bibtex
51
- @misc{vonwerra2022trl,
52
- title = {{TRL: Transformer Reinforcement Learning}},
53
- author = {Leandro von Werra and Younes Belkada and Lewis Tunstall and Edward Beeching and Tristan Thrush and Nathan Lambert and Shengyi Huang and Kashif Rasul and Quentin Gallou{\'e}dec},
54
- year = 2020,
55
- journal = {GitHub repository},
56
- publisher = {GitHub},
57
- howpublished = {\url{https://github.com/huggingface/trl}}
58
- }
59
- ```
 
1
  ---
 
 
 
2
  tags:
3
+ - gguf
4
+ - llama.cpp
5
  - unsloth
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
 
7
+ ---
 
 
 
 
 
 
 
8
 
9
+ # test-model - GGUF
 
 
 
 
10
 
11
+ This model was finetuned and converted to GGUF format using [Unsloth](https://github.com/unslothai/unsloth).
12
 
13
+ **Example usage**:
14
+ - For text only LLMs: **llama-cli** **--hf** repo_id/model_name **-p** "why is the sky blue?"
15
+ - For multimodal models: **llama-mtmd-cli** **-m** model_name.gguf **--mmproj** mmproj_file.gguf
16
 
17
+ ## Available Model files:
18
+ - `Llama-3.2-3B-Instruct.Q4_K_M.gguf`
19
 
20
+ ## Ollama
21
+ An Ollama Modelfile is included for easy deployment.