Update README.md
Browse files
README.md
CHANGED
|
@@ -58,10 +58,10 @@ The model is optimized to call these mobile action functions:
|
|
| 58 |
Fine-tuned using Hugging Face [TRL (Transformer Reinforcement Learning)](https://huggingface.co/docs/trl) with the `SFTTrainer`.
|
| 59 |
|
| 60 |
**Training Configuration**:
|
| 61 |
-
- **Epochs**:
|
| 62 |
-
- **Batch size**:
|
| 63 |
-
- **Gradient accumulation steps**:
|
| 64 |
-
- **Learning rate**:
|
| 65 |
- **Scheduler**: Cosine
|
| 66 |
- **Max sequence length**: 997 tokens (based on longest example: 897 tokens)
|
| 67 |
- **Optimizer**: AdamW (fused)
|
|
|
|
| 58 |
Fine-tuned using Hugging Face [TRL (Transformer Reinforcement Learning)](https://huggingface.co/docs/trl) with the `SFTTrainer`.
|
| 59 |
|
| 60 |
**Training Configuration**:
|
| 61 |
+
- **Epochs**: 4
|
| 62 |
+
- **Batch size**: 8 per device
|
| 63 |
+
- **Gradient accumulation steps**: 4
|
| 64 |
+
- **Learning rate**: 5e-5
|
| 65 |
- **Scheduler**: Cosine
|
| 66 |
- **Max sequence length**: 997 tokens (based on longest example: 897 tokens)
|
| 67 |
- **Optimizer**: AdamW (fused)
|