rokugatsu commited on
Commit
becdcc2
·
verified ·
1 Parent(s): 13819ed

Upload merged Qwen3-4B-Instruct-2507 model (auto-generated README)

Browse files
README.md CHANGED
@@ -2,6 +2,9 @@
2
  base_model: Qwen/Qwen3-4B-Instruct-2507
3
  datasets:
4
  - u-10bei/sft_alfworld_trajectory_dataset_v5
 
 
 
5
  language:
6
  - en
7
  license: apache-2.0
@@ -37,7 +40,7 @@ tool use, and recovery from errors.
37
  - Base model: Qwen/Qwen3-4B-Instruct-2507
38
  - Method: LoRA (full precision base)
39
  - Max sequence length: 2048
40
- - Epochs: 2
41
  - Learning rate: 2e-06
42
  - LoRA: r=64, alpha=128
43
 
@@ -62,7 +65,7 @@ model = PeftModel.from_pretrained(model, adapter)
62
 
63
  ## Sources & Terms (IMPORTANT)
64
 
65
- Training data: u-10bei/sft_alfworld_trajectory_dataset_v5
66
 
67
  Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
68
  Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.
 
2
  base_model: Qwen/Qwen3-4B-Instruct-2507
3
  datasets:
4
  - u-10bei/sft_alfworld_trajectory_dataset_v5
5
+ - u-10bei/sft_alfworld_trajectory_dataset_v5
6
+ - u-10bei/sft_alfworld_trajectory_dataset_v5
7
+ - u-10bei/sft_alfworld_trajectory_dataset_v5
8
  language:
9
  - en
10
  license: apache-2.0
 
40
  - Base model: Qwen/Qwen3-4B-Instruct-2507
41
  - Method: LoRA (full precision base)
42
  - Max sequence length: 2048
43
+ - Epochs: 1
44
  - Learning rate: 2e-06
45
  - LoRA: r=64, alpha=128
46
 
 
65
 
66
  ## Sources & Terms (IMPORTANT)
67
 
68
+ Training data: u-10bei/sft_alfworld_trajectory_dataset_v5, u-10bei/sft_alfworld_trajectory_dataset_v5, u-10bei/sft_alfworld_trajectory_dataset_v5, u-10bei/sft_alfworld_trajectory_dataset_v5
69
 
70
  Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
71
  Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4868a8181590aab553c3c7381108ef454404793d2afd398521da13a2b7b1971f
3
  size 4967215360
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5ee5f03681cb9ef2b5012971d08ffee2d49ae565ab6ce97170a21481682df5d6
3
  size 4967215360
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e4667c95dabfc3c849d16235b51a92c1db6cd48b8f317779e754bf44eb6507b3
3
  size 3077766632
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f6f754e470240e299735ce7703b1e9870922f6c973dcbf79465c62ec3fb37f31
3
  size 3077766632