wan-wan commited on
Commit
5e9945b
·
verified ·
1 Parent(s): e2304d1

Upload merged Qwen3-4B-Instruct-2507 model (auto-generated README)

Browse files
README.md CHANGED
@@ -37,9 +37,9 @@ tool use, and recovery from errors.
37
  - Base model: Qwen/Qwen3-4B-Instruct-2507
38
  - Method: LoRA (full precision base)
39
  - Max sequence length: 2048
40
- - Epochs: 3
41
- - Learning rate: 8e-06
42
- - LoRA: r=128, alpha=128
43
 
44
  ## Usage
45
 
 
37
  - Base model: Qwen/Qwen3-4B-Instruct-2507
38
  - Method: LoRA (full precision base)
39
  - Max sequence length: 2048
40
+ - Epochs: 2
41
+ - Learning rate: 2e-05
42
+ - LoRA: r=32, alpha=128
43
 
44
  ## Usage
45
 
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c143e18e171e1b9acf76843224d731a2bcac1a1a2f82f697912a8f08d899a31f
3
  size 4967215360
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a3263a8ebd6247658c6c881e9e58af0858ea66a92a5852048bdfa27e837081ef
3
  size 4967215360
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d0100b47b3e17144651a9726f48d164e80cdd969054bff3590b7e49c33e844b3
3
  size 3077766632
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b4cdfbb016a98c87f99dcfc4a77fa735dad85a22043da16d40b5802e3bc5950a
3
  size 3077766632