moushi21 commited on
Commit
dfe00a2
·
verified ·
1 Parent(s): ef6a347

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -33,7 +33,7 @@ This model is specialized for **ALFWorld trajectory tasks**, trained to handle m
33
  - **Format**: Merged Full Weights (bfloat16)
34
  - **Method**: LoRA fine-tuning (Merged via Unsloth `merge_and_unload`)
35
  - **Max sequence length**: 4096
36
- - **Epochs**: 1
37
  - **Learning rate**: 5e-07
38
  - **LoRA Parameters during training**: r=64, alpha=128
39
  - **Platform**: Trained with Unsloth
 
33
  - **Format**: Merged Full Weights (bfloat16)
34
  - **Method**: LoRA fine-tuning (Merged via Unsloth `merge_and_unload`)
35
  - **Max sequence length**: 4096
36
+ - **Steps**: 600
37
  - **Learning rate**: 5e-07
38
  - **LoRA Parameters during training**: r=64, alpha=128
39
  - **Platform**: Trained with Unsloth