YAI777 commited on
Commit
7598143
·
verified ·
1 Parent(s): a4ced58

Upload merged Qwen3-4B-Instruct-2507 model (auto-generated README)

Browse files
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  base_model: Qwen/Qwen3-4B-Instruct-2507
3
  datasets:
4
- - u-10bei/dbbench_sft_dataset_react_v4
5
  language:
6
  - en
7
  license: apache-2.0
@@ -14,6 +14,9 @@ tags:
14
  - alfworld
15
  - dbbench
16
  ---
 
 
 
17
  This repository provides a **LoRA adapter** fine-tuned from
18
  **Qwen/Qwen3-4B-Instruct-2507** using **LoRA + Unsloth**.
19
 
@@ -33,9 +36,9 @@ tool use, and recovery from errors.
33
 
34
  - Base model: Qwen/Qwen3-4B-Instruct-2507
35
  - Method: LoRA (full precision base)
36
- - Max sequence length: 8192
37
  - Epochs: 2
38
- - Learning rate: 1e-06
39
  - LoRA: r=64, alpha=128
40
 
41
  ## Usage
@@ -59,7 +62,7 @@ model = PeftModel.from_pretrained(model, adapter)
59
 
60
  ## Sources & Terms (IMPORTANT)
61
 
62
- Training data: u-10bei/dbbench_sft_dataset_react_v4
63
 
64
  Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
65
  Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.
 
1
  ---
2
  base_model: Qwen/Qwen3-4B-Instruct-2507
3
  datasets:
4
+ - u-10bei/sft_alfworld_trajectory_dataset_v5
5
  language:
6
  - en
7
  license: apache-2.0
 
14
  - alfworld
15
  - dbbench
16
  ---
17
+
18
+ # <【課題】ここは自分で記入して下さい>
19
+
20
  This repository provides a **LoRA adapter** fine-tuned from
21
  **Qwen/Qwen3-4B-Instruct-2507** using **LoRA + Unsloth**.
22
 
 
36
 
37
  - Base model: Qwen/Qwen3-4B-Instruct-2507
38
  - Method: LoRA (full precision base)
39
+ - Max sequence length: 8184
40
  - Epochs: 2
41
+ - Learning rate: 1e-05
42
  - LoRA: r=64, alpha=128
43
 
44
  ## Usage
 
62
 
63
  ## Sources & Terms (IMPORTANT)
64
 
65
+ Training data: u-10bei/sft_alfworld_trajectory_dataset_v5
66
 
67
  Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
68
  Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:915460d174261b07c02cde5c472face0b8a6c862cc4da7e932518927e46b9b90
3
  size 4967215360
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:65021d8df9e20112911792445a51df55552a6c78c291a13d3a36cc17b3c0ad07
3
  size 4967215360
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3ac1132c1e598a60b035bec7f21f480a1d42b2609b551fd28b3162b16003d842
3
  size 3077766632
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:12d7d21468897595da5f1c259789ce3b237ecbd0642d7c11243decd727a67031
3
  size 3077766632