Update README.md
Browse files
README.md
CHANGED
|
@@ -33,7 +33,7 @@ while intermediate reasoning (Chain-of-Thought) is masked.
|
|
| 33 |
|
| 34 |
- Base model: Qwen/Qwen3-4B-Instruct-2507
|
| 35 |
- Method: QLoRA (4-bit)
|
| 36 |
-
- Max sequence length:
|
| 37 |
- Epochs: 1
|
| 38 |
- Learning rate: 1e-06
|
| 39 |
- LoRA: r=64, alpha=128
|
|
|
|
| 33 |
|
| 34 |
- Base model: Qwen/Qwen3-4B-Instruct-2507
|
| 35 |
- Method: QLoRA (4-bit)
|
| 36 |
+
- Max sequence length: 768
|
| 37 |
- Epochs: 1
|
| 38 |
- Learning rate: 1e-06
|
| 39 |
- LoRA: r=64, alpha=128
|