Barry661 commited on
Commit
e6e3d90
·
verified ·
1 Parent(s): f2ccd53

Training in progress, step 30

Browse files
Files changed (3) hide show
  1. README.md +3 -5
  2. adapter_config.json +4 -4
  3. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -1,20 +1,18 @@
1
  ---
2
  base_model: Qwen/Qwen2.5-7B-Instruct
3
- datasets: HuggingFaceH4/deita-10k-v0-sft
4
  library_name: transformers
5
  model_name: Qwen2.5-7B-Instruct
6
  tags:
7
  - generated_from_trainer
8
  - sft
 
9
  - trl
10
- - trackio
11
- - trackio:https://Barry661-Qwen2.5-7B-Instruct.hf.space?project=huggingface&runs=Barry661-1773039020&sidebar=collapsed
12
  licence: license
13
  ---
14
 
15
  # Model Card for Qwen2.5-7B-Instruct
16
 
17
- This model is a fine-tuned version of [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) on the [HuggingFaceH4/deita-10k-v0-sft](https://huggingface.co/datasets/HuggingFaceH4/deita-10k-v0-sft) dataset.
18
  It has been trained using [TRL](https://github.com/huggingface/trl).
19
 
20
  ## Quick start
@@ -31,7 +29,7 @@ print(output["generated_text"])
31
  ## Training procedure
32
 
33
 
34
- [<img src="https://raw.githubusercontent.com/gradio-app/trackio/refs/heads/main/trackio/assets/badge.png" alt="Visualize in Trackio" title="Visualize in Trackio" width="150" height="24"/>](https://Barry661-Qwen2.5-7B-Instruct.hf.space?project=huggingface&runs=Barry661-1773039020&sidebar=collapsed)
35
 
36
 
37
  This model was trained with SFT.
 
1
  ---
2
  base_model: Qwen/Qwen2.5-7B-Instruct
 
3
  library_name: transformers
4
  model_name: Qwen2.5-7B-Instruct
5
  tags:
6
  - generated_from_trainer
7
  - sft
8
+ - trackio:https://Barry661-Qwen2.5-7B-Instruct.hf.space?project=huggingface&runs=Barry661-1773112846&sidebar=collapsed
9
  - trl
 
 
10
  licence: license
11
  ---
12
 
13
  # Model Card for Qwen2.5-7B-Instruct
14
 
15
+ This model is a fine-tuned version of [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct).
16
  It has been trained using [TRL](https://github.com/huggingface/trl).
17
 
18
  ## Quick start
 
29
  ## Training procedure
30
 
31
 
32
+ [<img src="https://raw.githubusercontent.com/gradio-app/trackio/refs/heads/main/trackio/assets/badge.png" alt="Visualize in Trackio" title="Visualize in Trackio" width="150" height="24"/>](https://Barry661-Qwen2.5-7B-Instruct.hf.space?project=huggingface&runs=Barry661-1773112846&sidebar=collapsed)
33
 
34
 
35
  This model was trained with SFT.
adapter_config.json CHANGED
@@ -32,13 +32,13 @@
32
  "rank_pattern": {},
33
  "revision": null,
34
  "target_modules": [
 
 
35
  "k_proj",
36
  "up_proj",
37
- "down_proj",
38
- "q_proj",
39
  "o_proj",
40
- "v_proj",
41
- "gate_proj"
42
  ],
43
  "target_parameters": null,
44
  "task_type": null,
 
32
  "rank_pattern": {},
33
  "revision": null,
34
  "target_modules": [
35
+ "v_proj",
36
+ "gate_proj",
37
  "k_proj",
38
  "up_proj",
 
 
39
  "o_proj",
40
+ "down_proj",
41
+ "q_proj"
42
  ],
43
  "target_parameters": null,
44
  "task_type": null,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e55ae8371fc0f6d304843e3033d619cc7822e1672a9ca8b05620e15355320934
3
  size 161533584
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8c4717821557f8e0e634a5c4d568632ba895c2abd56bbd71b5de75c4e2169603
3
  size 161533584