Update README.md
Browse files
README.md
CHANGED
|
@@ -38,7 +38,7 @@ action selection, and error recovery.
|
|
| 38 |
- **Base model:** Qwen/Qwen2.5-7B-Instruct
|
| 39 |
- **Method:** LoRA (merged post-training)
|
| 40 |
- **Max sequence length:** 2048
|
| 41 |
-
- **Epochs:**
|
| 42 |
- **Learning rate:** 2e-06
|
| 43 |
- **LoRA Parameters:** r=64, alpha=128
|
| 44 |
|
|
|
|
| 38 |
- **Base model:** Qwen/Qwen2.5-7B-Instruct
|
| 39 |
- **Method:** LoRA (merged post-training)
|
| 40 |
- **Max sequence length:** 2048
|
| 41 |
+
- **Epochs:** 4
|
| 42 |
- **Learning rate:** 2e-06
|
| 43 |
- **LoRA Parameters:** r=64, alpha=128
|
| 44 |
|