naru0411 commited on
Commit
73f991c
·
verified ·
1 Parent(s): a90ea36

Upload merged Qwen3-4B-Instruct-2507 model (auto-generated README)

Browse files
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  base_model: Qwen/Qwen3-4B-Instruct-2507
3
  datasets:
4
- - alfworld_full_success_v2.jsonl
5
  language:
6
  - en
7
  license: apache-2.0
@@ -38,8 +38,8 @@ tool use, and recovery from errors.
38
  - Method: LoRA (full precision base)
39
  - Max sequence length: 4096
40
  - Epochs: 2
41
- - Learning rate: 1e-06
42
- - LoRA: r=32, alpha=64
43
 
44
  ## Usage
45
 
@@ -62,7 +62,7 @@ model = PeftModel.from_pretrained(model, adapter)
62
 
63
  ## Sources & Terms (IMPORTANT)
64
 
65
- Training data: alfworld_full_success_v2.jsonl
66
 
67
  Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
68
  Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.
 
1
  ---
2
  base_model: Qwen/Qwen3-4B-Instruct-2507
3
  datasets:
4
+ - alfworld_cleaned_no_upsampling.jsonl
5
  language:
6
  - en
7
  license: apache-2.0
 
38
  - Method: LoRA (full precision base)
39
  - Max sequence length: 4096
40
  - Epochs: 2
41
+ - Learning rate: 2e-06
42
+ - LoRA: r=64, alpha=128
43
 
44
  ## Usage
45
 
 
62
 
63
  ## Sources & Terms (IMPORTANT)
64
 
65
+ Training data: alfworld_cleaned_no_upsampling.jsonl
66
 
67
  Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
68
  Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:65c584ede773adab21326b84b8200ce44e0afb3718f9fcd2d2d4d69cdab96eab
3
  size 4967215360
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:56026c0b7a37d4e3801ce44e92a70a77f32fe3b3e7b23e34ce046936364be8f2
3
  size 4967215360
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:debc4b7fb241a79a01bafa6d5536ab1aa434de74856e17f757947614d5cb9914
3
  size 3077766632
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:36cec146becb3fdfebdda15469cad6d4717766220d4af6cbc33ae2394f5f4c6b
3
  size 3077766632