yiling24 commited on
Commit
d008b51
·
verified ·
1 Parent(s): b5eaf88

Training in progress, step 30

Browse files
Files changed (3) hide show
  1. README.md +5 -7
  2. adapter_config.json +2 -2
  3. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -1,20 +1,18 @@
1
  ---
2
  base_model: ibm-granite/granite-4.0-micro
3
- datasets: HuggingFaceH4/cai-conversation-harmless
4
  library_name: transformers
5
  model_name: granite-4.0-micro-cai
6
  tags:
7
  - generated_from_trainer
8
- - trl
9
  - sft
10
- - trackio
11
- - trackio:https://yiling24-granite-4.0-micro-cai.hf.space?project=huggingface&runs=yiling24-1773068822&sidebar=collapsed
12
  licence: license
13
  ---
14
 
15
  # Model Card for granite-4.0-micro-cai
16
 
17
- This model is a fine-tuned version of [ibm-granite/granite-4.0-micro](https://huggingface.co/ibm-granite/granite-4.0-micro) on the [HuggingFaceH4/cai-conversation-harmless](https://huggingface.co/datasets/HuggingFaceH4/cai-conversation-harmless) dataset.
18
  It has been trained using [TRL](https://github.com/huggingface/trl).
19
 
20
  ## Quick start
@@ -31,7 +29,7 @@ print(output["generated_text"])
31
  ## Training procedure
32
 
33
 
34
- [<img src="https://raw.githubusercontent.com/gradio-app/trackio/refs/heads/main/trackio/assets/badge.png" alt="Visualize in Trackio" title="Visualize in Trackio" width="150" height="24"/>](https://yiling24-granite-4.0-micro-cai.hf.space?project=huggingface&runs=yiling24-1773068822&sidebar=collapsed)
35
 
36
 
37
  This model was trained with SFT.
@@ -39,7 +37,7 @@ This model was trained with SFT.
39
  ### Framework versions
40
 
41
  - TRL: 0.29.0
42
- - Transformers: 4.57.6
43
  - Pytorch: 2.10.0+cu128
44
  - Datasets: 4.0.0
45
  - Tokenizers: 0.22.2
 
1
  ---
2
  base_model: ibm-granite/granite-4.0-micro
 
3
  library_name: transformers
4
  model_name: granite-4.0-micro-cai
5
  tags:
6
  - generated_from_trainer
7
+ - trackio:https://yiling24-granite-4.0-micro-cai.hf.space?project=huggingface&runs=yiling24-1773112443&sidebar=collapsed
8
  - sft
9
+ - trl
 
10
  licence: license
11
  ---
12
 
13
  # Model Card for granite-4.0-micro-cai
14
 
15
+ This model is a fine-tuned version of [ibm-granite/granite-4.0-micro](https://huggingface.co/ibm-granite/granite-4.0-micro).
16
  It has been trained using [TRL](https://github.com/huggingface/trl).
17
 
18
  ## Quick start
 
29
  ## Training procedure
30
 
31
 
32
+ [<img src="https://raw.githubusercontent.com/gradio-app/trackio/refs/heads/main/trackio/assets/badge.png" alt="Visualize in Trackio" title="Visualize in Trackio" width="150" height="24"/>](https://yiling24-granite-4.0-micro-cai.hf.space?project=huggingface&runs=yiling24-1773112443&sidebar=collapsed)
33
 
34
 
35
  This model was trained with SFT.
 
37
  ### Framework versions
38
 
39
  - TRL: 0.29.0
40
+ - Transformers: 5.0.0
41
  - Pytorch: 2.10.0+cu128
42
  - Datasets: 4.0.0
43
  - Tokenizers: 0.22.2
adapter_config.json CHANGED
@@ -33,11 +33,11 @@
33
  "revision": null,
34
  "target_modules": [
35
  "k_proj",
36
- "gate_proj",
37
  "down_proj",
 
 
38
  "up_proj",
39
  "q_proj",
40
- "o_proj",
41
  "v_proj"
42
  ],
43
  "target_parameters": null,
 
33
  "revision": null,
34
  "target_modules": [
35
  "k_proj",
 
36
  "down_proj",
37
+ "o_proj",
38
+ "gate_proj",
39
  "up_proj",
40
  "q_proj",
 
41
  "v_proj"
42
  ],
43
  "target_parameters": null,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2f3bf6a641844cd096fd8ad77fa52d6db76da726569604dcf1b5bf61d8664166
3
  size 41986272
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e3ca64522938a98b65d3c0a74cc8e3609b69973fa30102640b6211e88b7dc657
3
  size 41986272