Update README.md
Browse files
README.md
CHANGED
|
@@ -70,9 +70,9 @@ This comprehensive training helps the model address a broad spectrum of hospital
|
|
| 70 |
|
| 71 |
- **Optimizer**: AdamW
|
| 72 |
- **Learning Rate**: 0.0002 with a cosine learning rate scheduler
|
| 73 |
-
- **Epochs**:
|
| 74 |
- **Batch Size**: 4
|
| 75 |
-
- **Gradient Accumulation Steps**:
|
| 76 |
- **Maximum Sequence Length**: 8192 tokens
|
| 77 |
|
| 78 |
### Environment
|
|
|
|
| 70 |
|
| 71 |
- **Optimizer**: AdamW
|
| 72 |
- **Learning Rate**: 0.0002 with a cosine learning rate scheduler
|
| 73 |
+
- **Epochs**: 4
|
| 74 |
- **Batch Size**: 4
|
| 75 |
+
- **Gradient Accumulation Steps**: 4
|
| 76 |
- **Maximum Sequence Length**: 8192 tokens
|
| 77 |
|
| 78 |
### Environment
|