Update training progress and artifacts
Browse files- README.md +5 -5
- model.safetensors +1 -1
README.md
CHANGED
|
@@ -19,13 +19,13 @@ The model files (merged weights and tokenizer) are stored at the root of this re
|
|
| 19 |
- **Diverse Datasets:** The script iterates through datasets available on the Hugging Face Hub, attempting to train on each.
|
| 20 |
- **Short Training Iterations:** Each training run per dataset configuration is currently set to a small number of steps (`max_steps=1`) to allow for rapid iteration across many datasets.
|
| 21 |
## Training Progress
|
| 22 |
-
- **Datasets Processed (Successfully trained on at least one config):**
|
| 23 |
-
- **Text Examples Streamed (Total):**
|
| 24 |
-
- **Tokens Processed (Total):**
|
| 25 |
-
- **Last Successful Model Update:** 2025-05-07 14:
|
| 26 |
### Evaluation Metrics
|
| 27 |
|
| 28 |
-
- **Overall Perplexity (on a small fixed dataset):**
|
| 29 |
|
| 30 |
#### Generated Examples (Qualitative Assessment)
|
| 31 |
|
|
|
|
| 19 |
- **Diverse Datasets:** The script iterates through datasets available on the Hugging Face Hub, attempting to train on each.
|
| 20 |
- **Short Training Iterations:** Each training run per dataset configuration is currently set to a small number of steps (`max_steps=1`) to allow for rapid iteration across many datasets.
|
| 21 |
## Training Progress
|
| 22 |
+
- **Datasets Processed (Successfully trained on at least one config):** 8
|
| 23 |
+
- **Text Examples Streamed (Total):** 48
|
| 24 |
+
- **Tokens Processed (Total):** 24576
|
| 25 |
+
- **Last Successful Model Update:** 2025-05-07 14:27:09 UTC
|
| 26 |
### Evaluation Metrics
|
| 27 |
|
| 28 |
+
- **Overall Perplexity (on a small fixed dataset):** 143114.84
|
| 29 |
|
| 30 |
#### Generated Examples (Qualitative Assessment)
|
| 31 |
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 80000008
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:39cb03b5008417eff14aa3cb8196faf959572a7ab871ce654172bd059ee0193d
|
| 3 |
size 80000008
|