Update README.md
Browse files
README.md
CHANGED
|
@@ -71,12 +71,12 @@ Fine-tuned using Hugging Face [TRL (Transformer Reinforcement Learning)](https:/
|
|
| 71 |
|
| 72 |
**Training Infrastructure**:
|
| 73 |
- **Hardware**: Google Colab A100 GPU
|
| 74 |
-
- **Training time**:
|
| 75 |
- **Library versions**: transformers==4.57.1, trl==0.25.1, datasets==4.4.1
|
| 76 |
|
| 77 |
### Training Results
|
| 78 |
|
| 79 |
-
Final metrics after
|
| 80 |
|
| 81 |
| Step | Training Loss | Validation Loss | Mean Token Accuracy |
|
| 82 |
|------|---------------|-----------------|---------------------|
|
|
|
|
| 71 |
|
| 72 |
**Training Infrastructure**:
|
| 73 |
- **Hardware**: Google Colab A100 GPU
|
| 74 |
+
- **Training time**: ~~60 minutes for 4 epochs
|
| 75 |
- **Library versions**: transformers==4.57.1, trl==0.25.1, datasets==4.4.1
|
| 76 |
|
| 77 |
### Training Results
|
| 78 |
|
| 79 |
+
Final metrics after 4 epochs:
|
| 80 |
|
| 81 |
| Step | Training Loss | Validation Loss | Mean Token Accuracy |
|
| 82 |
|------|---------------|-----------------|---------------------|
|