Update README.md
Browse files
README.md
CHANGED
|
@@ -33,7 +33,7 @@ This model is specialized for **ALFWorld trajectory tasks**, trained to handle m
|
|
| 33 |
- **Format**: Merged Full Weights (bfloat16)
|
| 34 |
- **Method**: LoRA fine-tuning (Merged via Unsloth `merge_and_unload`)
|
| 35 |
- **Max sequence length**: 4096
|
| 36 |
-
- **
|
| 37 |
- **Learning rate**: 5e-07
|
| 38 |
- **LoRA Parameters during training**: r=64, alpha=128
|
| 39 |
- **Platform**: Trained with Unsloth
|
|
|
|
| 33 |
- **Format**: Merged Full Weights (bfloat16)
|
| 34 |
- **Method**: LoRA fine-tuning (Merged via Unsloth `merge_and_unload`)
|
| 35 |
- **Max sequence length**: 4096
|
| 36 |
+
- **Steps**: 600
|
| 37 |
- **Learning rate**: 5e-07
|
| 38 |
- **LoRA Parameters during training**: r=64, alpha=128
|
| 39 |
- **Platform**: Trained with Unsloth
|