jprtr commited on
Commit
e2226c1
·
verified ·
1 Parent(s): 3da8a63

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -58,10 +58,10 @@ The model is optimized to call these mobile action functions:
58
  Fine-tuned using Hugging Face [TRL (Transformer Reinforcement Learning)](https://huggingface.co/docs/trl) with the `SFTTrainer`.
59
 
60
  **Training Configuration**:
61
- - **Epochs**: 2
62
- - **Batch size**: 4 per device
63
- - **Gradient accumulation steps**: 8
64
- - **Learning rate**: 1e-5
65
  - **Scheduler**: Cosine
66
  - **Max sequence length**: 997 tokens (based on longest example: 897 tokens)
67
  - **Optimizer**: AdamW (fused)
 
58
  Fine-tuned using Hugging Face [TRL (Transformer Reinforcement Learning)](https://huggingface.co/docs/trl) with the `SFTTrainer`.
59
 
60
  **Training Configuration**:
61
+ - **Epochs**: 4
62
+ - **Batch size**: 8 per device
63
+ - **Gradient accumulation steps**: 4
64
+ - **Learning rate**: 5e-5
65
  - **Scheduler**: Cosine
66
  - **Max sequence length**: 997 tokens (based on longest example: 897 tokens)
67
  - **Optimizer**: AdamW (fused)