Upload merged Qwen3-4B-Instruct-2507 model (auto-generated README)

Files changed (3) hide show

README.md CHANGED Viewed

@@ -37,9 +37,9 @@ tool use, and recovery from errors.
 - Base model: Qwen/Qwen3-4B-Instruct-2507
 - Method: LoRA (full precision base)
 - Max sequence length: 2048
-- Epochs: 3
-- Learning rate: 8e-06
-- LoRA: r=128, alpha=128
 ## Usage

 - Base model: Qwen/Qwen3-4B-Instruct-2507
 - Method: LoRA (full precision base)
 - Max sequence length: 2048
+- Epochs: 2
+- Learning rate: 2e-05
+- LoRA: r=32, alpha=128
 ## Usage

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c143e18e171e1b9acf76843224d731a2bcac1a1a2f82f697912a8f08d899a31f
 size 4967215360

 version https://git-lfs.github.com/spec/v1
+oid sha256:a3263a8ebd6247658c6c881e9e58af0858ea66a92a5852048bdfa27e837081ef
 size 4967215360

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d0100b47b3e17144651a9726f48d164e80cdd969054bff3590b7e49c33e844b3
 size 3077766632

 version https://git-lfs.github.com/spec/v1
+oid sha256:b4cdfbb016a98c87f99dcfc4a77fa735dad85a22043da16d40b5802e3bc5950a
 size 3077766632