Update training progress and artifacts
Browse files- README.md +5 -5
- model.safetensors +1 -1
README.md
CHANGED
|
@@ -19,13 +19,13 @@ The model files (merged weights and tokenizer) are stored at the root of this re
|
|
| 19 |
- **Diverse Datasets:** The script iterates through datasets available on the Hugging Face Hub, attempting to train on each.
|
| 20 |
- **Short Training Iterations:** Each training run per dataset configuration is currently set to a small number of steps (`max_steps=1`) to allow for rapid iteration across many datasets.
|
| 21 |
## Training Progress
|
| 22 |
-
- **Datasets Processed (Successfully trained on at least one config):**
|
| 23 |
-
- **Text Examples Streamed (Total):**
|
| 24 |
-
- **Tokens Processed (Total):**
|
| 25 |
-
- **Last Successful Model Update:** 2025-05-07 14:
|
| 26 |
### Evaluation Metrics
|
| 27 |
|
| 28 |
-
- **Overall Perplexity (on a small fixed dataset):**
|
| 29 |
|
| 30 |
#### Generated Examples (Qualitative Assessment)
|
| 31 |
|
|
|
|
| 19 |
- **Diverse Datasets:** The script iterates through datasets available on the Hugging Face Hub, attempting to train on each.
|
| 20 |
- **Short Training Iterations:** Each training run per dataset configuration is currently set to a small number of steps (`max_steps=1`) to allow for rapid iteration across many datasets.
|
| 21 |
## Training Progress
|
| 22 |
+
- **Datasets Processed (Successfully trained on at least one config):** 9
|
| 23 |
+
- **Text Examples Streamed (Total):** 54
|
| 24 |
+
- **Tokens Processed (Total):** 27648
|
| 25 |
+
- **Last Successful Model Update:** 2025-05-07 14:28:37 UTC
|
| 26 |
### Evaluation Metrics
|
| 27 |
|
| 28 |
+
- **Overall Perplexity (on a small fixed dataset):** 143030.67
|
| 29 |
|
| 30 |
#### Generated Examples (Qualitative Assessment)
|
| 31 |
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 80000008
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e635dfb0e375a4055d47994b1bcd0f76dc8bc7c8f4c7ae490bc79f0ae9b283eb
|
| 3 |
size 80000008
|