chronobcelp commited on
Commit
570b64e
·
verified ·
1 Parent(s): a4e7a20

Upload merged model (patched README metadata)

Browse files
README.md CHANGED
@@ -1,7 +1,5 @@
1
  ---
2
  base_model: Qwen/Qwen3-4B-Instruct-2507
3
- datasets:
4
- - u-10bei/dbbench_sft_dataset_react_v4
5
  language:
6
  - en
7
  license: apache-2.0
@@ -37,9 +35,14 @@ tool use, and recovery from errors.
37
  - Base model: Qwen/Qwen3-4B-Instruct-2507
38
  - Method: LoRA (full precision base)
39
  - Max sequence length: 4096
40
- - Epochs: 2
41
- - Learning rate: 1e-05
42
  - LoRA: r=64, alpha=128
 
 
 
 
 
43
 
44
  ## Usage
45
 
@@ -62,7 +65,7 @@ model = PeftModel.from_pretrained(model, adapter)
62
 
63
  ## Sources & Terms (IMPORTANT)
64
 
65
- Training data: u-10bei/dbbench_sft_dataset_react_v4
66
 
67
  Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
68
  Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.
 
1
  ---
2
  base_model: Qwen/Qwen3-4B-Instruct-2507
 
 
3
  language:
4
  - en
5
  license: apache-2.0
 
35
  - Base model: Qwen/Qwen3-4B-Instruct-2507
36
  - Method: LoRA (full precision base)
37
  - Max sequence length: 4096
38
+ - Epochs: 3
39
+ - Learning rate: 8e-06
40
  - LoRA: r=64, alpha=128
41
+ - warmup_ratio : 0
42
+ - weight_decay : 0
43
+ - lora_dropout : 0
44
+ - lora_target_modules :['q_proj', 'k_proj', 'v_proj', 'o_proj', 'gate_proj', 'up_proj', 'down_proj']
45
+ - grad_accum : 8
46
 
47
  ## Usage
48
 
 
65
 
66
  ## Sources & Terms (IMPORTANT)
67
 
68
+ Training data: /content/dbbench_react_merged_dedup
69
 
70
  Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
71
  Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f79c7176b2de27711dda2731ca6519e0f477f5e375a4d442dac0a2c76629a8bf
3
  size 4967215360
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f9fa07dcd0661f7c5f32959c5728f56dd836c79ab33c28c41ce7aab1daea561
3
  size 4967215360
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1d8774f1e2e2dec4fc7b4287b49222d750d307f9ce63aad99a7bcc56d27ea0f1
3
  size 3077766632
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3b76935d81bc551e770592efe411075ba84b8b2f5c232fe638d9f69c706738db
3
  size 3077766632