Nafifa commited on
Commit
b9791fc
·
verified ·
1 Parent(s): 8afe3bd

Training in progress, step 30

Browse files
README.md CHANGED
@@ -1,20 +1,18 @@
1
  ---
2
  base_model: ibm-granite/granite-4.0-micro
3
- datasets: HuggingFaceH4/cai-conversation-harmless-old
4
  library_name: transformers
5
  model_name: granite-4.0-micro
6
  tags:
7
  - generated_from_trainer
 
8
  - sft
9
  - trl
10
- - trackio:https://Nafifa-granite-4.0-micro.hf.space?project=huggingface&runs=Nafifa-1773132048&sidebar=collapsed
11
- - trackio
12
  licence: license
13
  ---
14
 
15
  # Model Card for granite-4.0-micro
16
 
17
- This model is a fine-tuned version of [ibm-granite/granite-4.0-micro](https://huggingface.co/ibm-granite/granite-4.0-micro) on the [HuggingFaceH4/cai-conversation-harmless-old](https://huggingface.co/datasets/HuggingFaceH4/cai-conversation-harmless-old) dataset.
18
  It has been trained using [TRL](https://github.com/huggingface/trl).
19
 
20
  ## Quick start
@@ -31,7 +29,7 @@ print(output["generated_text"])
31
  ## Training procedure
32
 
33
 
34
- [<img src="https://raw.githubusercontent.com/gradio-app/trackio/refs/heads/main/trackio/assets/badge.png" alt="Visualize in Trackio" title="Visualize in Trackio" width="150" height="24"/>](https://Nafifa-granite-4.0-micro.hf.space?project=huggingface&runs=Nafifa-1773132048&sidebar=collapsed)
35
 
36
 
37
  This model was trained with SFT.
@@ -39,7 +37,7 @@ This model was trained with SFT.
39
  ### Framework versions
40
 
41
  - TRL: 0.29.0
42
- - Transformers: 4.57.6
43
  - Pytorch: 2.10.0+cu128
44
  - Datasets: 4.0.0
45
  - Tokenizers: 0.22.2
 
1
  ---
2
  base_model: ibm-granite/granite-4.0-micro
 
3
  library_name: transformers
4
  model_name: granite-4.0-micro
5
  tags:
6
  - generated_from_trainer
7
+ - trackio:https://Nafifa-granite-4.0-micro.hf.space?project=huggingface&runs=Nafifa-1773292479&sidebar=collapsed
8
  - sft
9
  - trl
 
 
10
  licence: license
11
  ---
12
 
13
  # Model Card for granite-4.0-micro
14
 
15
+ This model is a fine-tuned version of [ibm-granite/granite-4.0-micro](https://huggingface.co/ibm-granite/granite-4.0-micro).
16
  It has been trained using [TRL](https://github.com/huggingface/trl).
17
 
18
  ## Quick start
 
29
  ## Training procedure
30
 
31
 
32
+ [<img src="https://raw.githubusercontent.com/gradio-app/trackio/refs/heads/main/trackio/assets/badge.png" alt="Visualize in Trackio" title="Visualize in Trackio" width="150" height="24"/>](https://Nafifa-granite-4.0-micro.hf.space?project=huggingface&runs=Nafifa-1773292479&sidebar=collapsed)
33
 
34
 
35
  This model was trained with SFT.
 
37
  ### Framework versions
38
 
39
  - TRL: 0.29.0
40
+ - Transformers: 5.0.0
41
  - Pytorch: 2.10.0+cu128
42
  - Datasets: 4.0.0
43
  - Tokenizers: 0.22.2
adapter_config.json CHANGED
@@ -32,12 +32,12 @@
32
  "rank_pattern": {},
33
  "revision": null,
34
  "target_modules": [
35
- "v_proj",
36
- "o_proj",
37
- "q_proj",
38
- "gate_proj",
39
  "k_proj",
40
  "up_proj",
 
 
 
 
41
  "down_proj"
42
  ],
43
  "target_parameters": null,
 
32
  "rank_pattern": {},
33
  "revision": null,
34
  "target_modules": [
 
 
 
 
35
  "k_proj",
36
  "up_proj",
37
+ "q_proj",
38
+ "o_proj",
39
+ "gate_proj",
40
+ "v_proj",
41
  "down_proj"
42
  ],
43
  "target_parameters": null,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5fdaf1349d13eb9ca2a779d5b45833b526836f6b779e3f76e16c6f4f07cb2220
3
  size 41986272
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a4577af5526286dd7d276119e1e7ec3481b60c5875c7308c5709860c31a4a24e
3
  size 41986272
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0aa8148c54974d382da3eac654f363d6eb5641458c63578b2effdb3f56eb3486
3
  size 5585
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a4bc849338a7a1d1de79a5c661646f5df2dbd7f0f80697f877648902fe9ba25a
3
  size 5585