Upload merged Qwen3-4B-Instruct-2507 model (auto-generated README)

Files changed (3) hide show

README.md CHANGED Viewed

@@ -37,9 +37,9 @@ tool use, and recovery from errors.
 - Base model: Qwen/Qwen3-4B-Instruct-2507
 - Method: LoRA (full precision base)
 - Max sequence length: 4096
-- Epochs: 2
-- Learning rate: 2e-05
-- LoRA: r=32, alpha=64
 ## Usage

 - Base model: Qwen/Qwen3-4B-Instruct-2507
 - Method: LoRA (full precision base)
 - Max sequence length: 4096
+- Epochs: 3
+- Learning rate: 1e-05
+- LoRA: r=64, alpha=128
 ## Usage

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:90b18b0c301a9917eeedca80de6b6f5c973db0590fb95a77d8c2e2a27c306e23
 size 4967215360

 version https://git-lfs.github.com/spec/v1
+oid sha256:b1f19be227dd0d22ccd4b7d25baee73fb818d2deae8c3ab8a3fed42305f56320
 size 4967215360

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0f0779e009e258891248afe7eb03dbab521f36727c4fd6ce75f74c593b8c7f57
 size 3077766632

 version https://git-lfs.github.com/spec/v1
+oid sha256:2f53492550d118964921d0fc80ef7d0db7558cbdbdac7e0d1e90207b0e9d4c8a
 size 3077766632